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TARGETED MODIFICATION OF CHROMATIN STRUCTURE 

TECHNICAL FIELD 
5 The present disclosure is in the fields of chromatin structure and genetic 

regulation, in particular, the modification of chromatin structure to facilitate interaction 
of molecules with a region of interest in cellular chromatin. 

BACKGROUND 

10 Regulation of gene expression m a cell is generally mediated by sequence-specific 

binding of gene regulatory molecules, often proteins, to chromosomal DNA. Regulatory 
proteins can effect either positive or negative regulation of gene expression. Generally, a 
regulatory protein will exhibit preference for binding to a particular binding sequence, or 
target site. Target sites for many regulatory proteins (and other molecules) are known or 

1 5 can be determined by one of skill in the art. 

Despite advances in the selection and design of sequence-specific DNA binding 
gene regulatory proteins, their application to the regulation of an endogenous cellular 
gene can, in some cases, be limited if their access to the target site is restricted in the cell. 
Possible sources of restricted access could be related to one or more aspects of the 

20 chromatin structure of the gene. Access can be influenced by the stmcture of the gene 
per se {e.g., nucleotide methylation) or by the structure of the chromosomal domain in 
which the gene resides. 

Cellular DNA, including the cellular genome, generally exists in the fomi of 
chromatin, a complex comprising nucleic acid and protein. Indeed, most cellular RNAs 

25 also exist in the form of nucleoprotein complexes. The nucleoprotein structure of 

chromatin has been the subject of extensive research, as is known to those of sMIl^iii the 
art. Ill general, chromosomal DNA is packaged into nucleosomes. A nucleosome 
comprises a core and a linker. The nucleosome core comprises an octamer of core 
histones (two each of H2A, H2B, H3 and H4) around which is wrapped approximately 

30 150 base pairs of chromosomal DNA. In addition, a linker DNA segment of 

approximately 50 base pairs is associated with linker histone HI (or a related linker 

1 
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histone in certain specialized cells). Nucleosomes are organized into a higher-order 
chromatin fiber (sometimes denoted a "solenoid" or a 30 nm fiber) and chromatin fibers 
are organized into chromosomes. See, for example, Wolffe "Chromatin: Stnicture and 
Fmiction" 3"^ Ed,, Academic Press, San Diego, 1998 and Komberg et al (1999) Cell 
5 98:285-294. 

Chromatin structure is not static, but is subject to modification by processes 
collectively known as chromatin remodeling. Chromatin remodeling can serve, for 
example, to remove nucleosomes from a region of DNA, move nucleosomes from one 
region of DNA to another, change the spacing between nucleosomes or add nucleosomes 
10 to a region of DNA in the chromosome. Chromatin remodeling can also result in changes 
m higher order stmcture, thereby influencing the balance between transcriptionally active 
chromatin (open chromatin or euchromatin) and transcriptionally inactive chromatin 
(closed chromatin or heterochromatm). 

Chromosomal proteins are subject to numerous types of chemical modification, 
15 some or all of which influence chromatin structure. For example, histones are subject to 
acetylation by histone acetyltransferases, deacetylation by histone deacetylases, 
melhylation by histone methyltransferases (and therefore presumably to demethylation by 
histone demethylases), ubiquitination by ubiquitin ligases, de-ubiquitination by ubiquitin 
hydrolases, phosphorylation by histone kinases, dephosphorylation by histone 
20 phosphatases, and reversible ADP-ribosylation by poly-ADP ribose polymerase (PARP, 
also known as TFIIC). Strahl et al (2000) Nature 403:41-45. Regulation of chromatin 
structure by methylation of histone H3 has been described. Rea et al (2000) Nature 
406:593-599. Modifications of non-histone chromosomal proteins include, for example, 
acetylation of HMG-1 (Munshi et al (1998) Mol Cell 2:457-467); HMGs 14 and 17 
25 (Sterner et al (1981) J. Biol Chem, 256:8892-8895; Herrera et al (1999) Mol Cell 
Biol 19:3466-3473; Bergel etal (2000) y. Biol Chem. 275:11,514-11,520) and 
chromathi-resident transcriptional regulators such as, for example, TFHE (Imhof al 
(1997) Curr. Biol 7:689-692), p53 (Gu et al (1997) Cell 90:595-606) and GATA-1 
(Boyes et al (1998) Nature 396:594-598). Chemical modification of histone and/or non- 
30 histone proteins is often a step in the chromatin remodeling process, and can have eittier 
positive or negative effects on gene e3q)ression. Generally, histone acetylation is 
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correlated with gene activation; while deacetylation of histones is correlated with gene 
repression. 

A number of enzymes capable of chemical modification of histones have been 
described and partially characterized. For example, histone acetyl transferases include 
5 GrcnSp, p300/CBP-associated factor (P/CAF), p300, CREB-binding protein (CBP), 
HATl, TFIID-associated factor 250 (TAFii250), aad steroid receptor coactivator-1 
(SRC-1). Wade et al (1997) Trends Biochem. Set 22:128432; Kouzarides (1999) Curr. 
Opin. Genet. Devel 9:40-48; Sterner et al (2000) Microbiol Mol Biol Rev. 64:435- 
459. The HDAC family of proteins have been identified as histone deacetylases and 

10 inclnde homologues to the budding yeast histone deacetylase RPD3 (e,g. , HDACl, 

HDAC2, HDAC3 and HDAC8) and homologues to the budding yeast histone deacetylase 
HDAI (e,g., HDAC4, HDAC5, HDAC6 and HDAC7). Ng et al (2000) Trends Biochem. 
Set 25: 121-126. The Rsk-2 (RKS90) kinase has been identified as a histone kinase, 
Sassone-Corsi et al (1999) Science 285:886-891 . A histone methyltransferase 

1 5 (CARM-1) has also been identified. Chen et al (1999) Science 284:2174-2177. 

Effects of alterations in chromatin structure upon gene expression have been 
reported or mferred. Fryer a/. (1998) iVa/wre 393:88-91; andKehlee^ a/. (1998) 
Science 282:1897-1900. 

Because of the dynamic structure of cellular chromatin, the ability of a regulatory 

20 molecule to bind its target site in a chromosome may be limited, in certain circumstances, 
by chromatin structure. For example, if a target site is present in "open" chromatin 
(generally thought of as nucleo some- free or having an altered nucleosomal conformation 
compared to bulk chromatin) structural barriers to the binding of a regulatory molecule to 
its target site are unhkely. By contrast, if a target site is present in "closed" chromatin 

25 {i.e. having extensive higher-order structure and/or close nucleosome spacing), steric 

barriers to binding are likely to exist. Thus, the ability of a regulatory molecule to bind to 
a target site in cellular chromatin will depend on the structure of the chromatin 
surrounding tlmt particular target site. The chromatin structure of a particular gene can 
vary depending on, for example, cell type and/or developmental stage. For this reason, 

30 the regulation of a given gene in a particular cell can be influenced not only by the 
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presence or absence of gene regulatory factors, but also by the chromatin structure of the 
gene. 

Remodeling of chromatin can lead to activation of gene expression in vitro. For 
example, the NURF chromatin remodeling complex stimulates the transcriptional 
5 activation activity of the GAGA transcription factor. Tsukiyama et al. (1995) Cell 
83 : 1 0 1 1 - 1 020. Transcriptional activation by a GAL4- VP 1 6 fusion requires the RSF 
chromatin remodeling complex. LeRoy et al (1998) Science 282:1900-1904. The 
S WI/SNF chromatin remodeling complex potentiates transcriptional activation by the 
VP 16 activation domain and by ligand-bound glucocorticoid receptor. Neely et al 

10 (1999) MoL Cell 4:649-655; Wallberg et al (2000) Mol Cell Biol 20:2004-2013. 

There are also several examples of a requirement for the activity of chromatm 
remodeling complexes for gene activation in vivo. The human SWI/SNF chromatin 
remodeling complex is required for the activity of the glucocorticoid receptor. Fryer et 
al (1998) Nature 393:88-91. The manmiahan SWI/SNF chromatin remodeling complex 

15 is required for activation of the hsp70 gene, de La Sema et al (2000) Mol Cell Biol 

20:2839-285 1 . Mutations in the Drosophila IS WI protein adversely affect expression of 
the engrailed and Ultrabithorax genes, Deuring et al (2000) Mol Cell 5:355-365. 
Finally, mutations in the yeast SWI/SNF gene resxilt in a decrease in expression of one 
group of genes and an increase in expression of another group of genes, showing that 

20 chromatin remodeling caa have both positive aud negative effects on gene expression. 

Holsteege et al (1998) Cell 95:717-728; Sudarsanam et al (2000) Proa Natl Acad. ScL 
USA 97:3364-3369. 

Despite this knowledge of the effects of chromatin remodeling on gene expression 
in vitro and in vivo^ methods for directed manipulation of chromatin structure are not 

25 available. Accordingly, for situations in which a regulatory molecule is prevented, by 
chromatin structure, from interacting with its target site, methods for targeted 
modification of chromatin stmcture are needed. Such methods would be useful, for 
example, to facilitate bindiug of regulatory molecules to cellular chromatin and/or to 
faciUtate access of DNA-binding molecules to cellular DNA sequences. This, in turn, 

30 would facilitate regulation of gene expression, either positively or negatively, by 
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endogenous and exogenous molecules, and provide additional methods for binding these 
' molectdes to binding sites within regions of interest in cellular chromatin. 

SUMMARY 

5 Disclosed herein are compositions and methods useful for targeted modification 

of chromatin. These compositions and methods are useful for facilitating processes that 
depend upon access of cellular DNA sequences to DNA-binding molecules, for example, 
transcription, replication, recombination, repair and integration. In one embodiment, 
targeted modification of chromatin facilitates regulation of gene expression by 

10 endogenous or exogenous raolecules, by providing access to cellular DNA sequences. 
Modification is any change in chromatin structure, compared to the normal state of the 
chroraatin in the cell in which it resides. 

Accordingly, in one embodiment, a method for modifying a region of interest in 
cellular chromatin is provided, wherein the method comprises contacting cellular 

1 5 chromatin with a fusion molecule that binds to a bmding site in the region of interest. 

The fusion molecule comprises a DNA-binding domain and a component of a chromatin 
remodeling complex or a functional jBragment thereof. In a preferred embodiment, the 
fusion molecule is a polypeptide. Cellular chromatin can be present in any type of cell, 
including prokaryotic, eucaryotic or archaeal. Eucaryotic cells include microorganisms, 

20 fungal cells, plants and animals, including vertebrate, mammalian and human cells. 

In certain embodiments, the DNA-binding domain of a fusion molecule comprises 
a triplex-forming nucleic acid, an intercalator^ an antibiotic, or a minor groove binder. In 
a preferred embodiment, the DNA-binding domain comprises a zinc fmger DNA-binding 
domain. In a more preferred embodiment, a fusion molecule is a fusion polypeptide 

25 comprising a zinc fmger DNA-binding domain. Other polypeptide DNA-binding 
domains are also use&l. 

The other portion of the fusion molecule is a component of a chromatm 
remodeling complex. Numerous chromatin remodeling complexes are known to those of 
skill in the art. Chromatin remodeling complexes generally contain an enzymatic 

30 component, which is often an ATPase, a histone acetyl transferase or a histone 
deacetylase. ATPase components include, but are not limited to, the following 
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polypeptides: SWI2/SNF2, Mi-2, ISWI, BRM, BRG/BAP, Chd-l, Chd^2, Clid-3, Chd-4 
and Mot-1. Additional non-enzymatic components, iQvolved ia positioning the 
enzymatic component with respect to its substrate and/or for interaction with other 
proteiaSj, are also present in chromatin remodeling complexes and can be used as a 
5 portion of a fusion molecule. Many components of chromatin remodeling complexes 
have been identified by sequence homology. Accordingly, additional chromatin 
remodeling complexes and their components are likely to be discovered and their use is 
contemplated by the present disclosure. 

Modification of chromatin structure will facilitate many processes that require 

10 access to cellular DNA. In one embodiment, chromatin modification faciUtates 

modulation of expression of a gene of interest Modulation of expression comprises 
activation or repression of a gene of interest. In a separate embodiment, chromatin 
modification facilitates recombination between an exogenous nucleic acid and cellular 
chromatin. In this way, targeted iategration of transgenes is accomplished more 

15 efficiently. 

As noted, a fusion molecule can be a pol5/peptide. Accordingly, in one 
embodiment, chromatin modification is accomplished by contacting a cell with a 
polynucleotide encoding a fusion polypeptide, such that the polynucleotide is introduced 
into the cell and the fusion polypeptide is expressed ia the cell. In this regard, fusion 

20 polypeptides comprising a fusion between a DNA-binding domain and a component of a 
cln'omatin remodeling complex (or functional fragment thereof), as well as 
polynucleotides encodmg them, are provided. Also provided are cells comprising these 
fusion polypeptides and cells comprising polynucleotides encoding these fusion 
polypeptides. Preferred are fiision polypeptides comprising a zinc finger DNA binding 

25 domain and polynucleotides encoding them. 

In one embodiment, a region of interest in cellular chromatin, which is to be 
modified, comprises a gene. Exemplary genes whose chromatin stmcture can be 
modified through the use of the compositions and methods disclosed herein include, but 
are not Umited to, vascular endothehal growth factor (VEGF), erythropoietin (EPO), 

30 androgen receptor, PPAR-y2, pl6, p53, Kb, dystrophin and e-cadherin. Accordingly, in 
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certain embodiments, liie DNA binding domain of the fusion molecule is selected to bind 
to a sequence (z.e., a target site) in one of the aforementioned genes. 

In certain embodiments, modification of chromatin structure, using a flision 
molecule as disclosed herein, is accompanied by an additional step of contactiiig cellular 
5 chromatin with a second molecule. Often, the modification of chromatin structure 
effected by the binding of flie fusion molecule facilitates the binding of the second 
molecule. The second molecule can be a transcription regulatory molecule, either an 
endogenous factor or one that is exogenously supplied to a cell. In certain embodiments, 
tlie second molecule is also a fusion molecule, preferably a fusion polypeptide. In a 

10 preferred embodiment, the second molecule comprises a zinc finger DNA-binding 
domain. The second molecule can also comprise, for example, a transcriptional 
activation domain or a transcriptional repression domain. Thus, in one embodiment, 
modification of chromatin structure, in a region of interest, by a fusion moleciale as 
disclosed herein provides access for the binding of a second molecule which caa regulate 

15 the transcription of a gene in or near the region of interest. 

In another embodiment, a second molecule is a fusion comprising a DNA bhiding 
domain and an enzyme (or functional fragment tiiereof) that covalently modifies histones, 
for example, a histone acetyl transferase or a histone deacetylase. In this way, a first 
fusion molecule facilitates remodeling of chromatin, making it a substrate for the activity 

20 of a second fusion molecule that facilitates covalent modification of the remodeled 
chromatin. Altematively, a second molecule can comprise a fusion between a DNA 
binding domain and a component of a chromatin remodeling complex that is different 
from the one present in the first molecule. In tliis way, it is possible to recruit multiple 
chromatin remodeling complexes to a region of interest in cellular chromatin. 

25 In yet another embodiment, cellular chromatin is contacted with three molecules. 

The first comprises a fusion between a DNA binding domain and a component of a 
chromatin remodeling complex or a functional fi-agment thereof. The second molecule 
can comprise, for example, a transcriptional regulatory molecule (endogenous or 
exogenous), a fusion between a DNA binding domain and a component of a chromatin 

30 remodeling complex or a fusion between a DNA binding domain and an en2yme that 
covalently modifies histones. The third molecule can be an endogenous or exogenous 
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transcriptional regulatory molecule, or a fijsion molecule. A fusion molecule can be a 
fusion polypeptide and can comprise a DNA binding domain (e.g., a zinc finger DNA 
binding domain) and a transcriptional regulatory domain, such as, for example, an 
activation domain or a repression domain. Thus, several combinations of molecules are 
5 possible. For example, the first and second molecules can be involved in modifying 
chromatin stmcture in a region of interest to allow access to that region by a third 
molecule which can be, for example, a molecule with transcriptional regulatory function. 
Alternatively, tlie first molecule can be involved in the modification of chi*omatin 
structure to allow access by the second and third molecules (both of which can be, for 

10 example, transcriptional regulatory molecules) in a region of interest in cellular 
chromatin. In another embodiment, the first molecule can facilitate chromatin 
remodeling in the region of interest, the second molecule can be involved in covalent 
modification of histones in the region of interest, and the third molecule can bind in the 
region of interest and possess transcriptional regulatory function. In similar fashion, 

1 5 fourth, fifth, etc. molecules can also be contacted Avith cellular chromatin to modify its 
stracture in a region of interest and effect regulation of a gene in that region. 

In one embodiment, methods for modulating expression of a gene comprise the 
steps of contactmg cellular chromatin with a first fusion molecule that binds to a binding 
site in cellular chromatin, wherein the binding site is in the gene, and wherein the first 

20 fusion molecule comprises a DNA-binding domain and a component of a chromatin 

remodeling complex or a fimctional firagment thereof, and further contacting the cellular 
chromatin with a second molecule that binds to a target site in the gene and modulates 
expression of the gene. In a preferred embodiment, the DNA-binding domain of the first 
fusion molecule is a zinc finger DNA-binding domain. 

25 The second molecule can be, for example, a small molecule therapeutic, a minor 

groove binder, a peptide, a polyamide, a DNA molecule, a triplex-forming 
oligonucleotide, an RNA molecule, or a polypeptide. Exemplary polypeptides include, 
but are not limited to, transcription factors, recombinases, integrases, helicases, and DNA 
or RNA polymerases. Any of the aforementioned molecules caa be either exogenous or 

30 endogenous. Altematively, the second molecule can be a second fusion molecule, for 
example, a fusion polypeptide. In a preferred embodiment, the second molecule is a 
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fusion polypeptide comprising a zinc finger DNA binding domain. The second fusion 
molecule can also comprise a transcriptional activation domain or a transcriptional 
repression domain. 

In certain embodiments of methods for modulating expression of a gene, a 
5 plurality of first fusion molecules, each having a distinct binding site in the gene, are 
contacted with cellular chromatin. Similarly, a plurality of second molecules, each 
having a distinct target site in the gene, can be contacted with celhilar chromatin in the 
practice of methods to modulate expression of a gene. Thus, the disclosed methods for 
modulating expression of a gene can include the use of a single first fusion molecule and 

10 a single second molecule, a single first fusion molecule and a pim ality of second 
molecules, a plurahty of first fusion molecules and a single second molecule, and a 
plurality of first fusion molecules and a plurality of second molecules. 

In additional embodiments, expression of a plurality of genes is modulated 
according to the disclosed methods. This can be accomphshed in several ways. In one 

1 5 embodiment, a plurality of first fusion molecules, each binding to a distinct binding site, 
wherein each distinct blading site is in a distinct gene, are contacted with cellular 
chromatin. One or more of the first fusion molecules can be a zinc finger fiision 
pol3/peptide comprising a zinc-finger DNA-binding domain. In certain embodiments, a 
first fiision molecule can bind to a shared binding site in two or more of the plurality of 

20 genes. In one embodiment of a method for modulating the expression of a plurality of 

genes, a single first fusion molecule binds to a shared binding site m all of the plurality of 
genes whose expression is modulated. 

Additional methods for modulating the expression of a plurality of genes involve 
contacting a plurality of second molecules with cellular chromatin, in combination with 

25 the contact of one or more first fusion molecules with cellular chromatin. Each of the 
plurality of second molecules can bind to a distinct target site, wherein each distinct 
target site is in a disthict gene. Alternatively, a single second molecule can bind to a 
shared target site in two or more different genes. In one embodiment, a sirfgle second 
molecule binds to a shared target site in all of the plurality of genes whose expression is 

30 modulated. 
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In certain embodimentSp to facilitate the binding of the fusion molecule to tlie 
cellular chromatin^ one or more accessible regions within the region of interest are 
identified and one or more target sites for the DNA-binding portion of the fusion 
molecule are identified within the accessible region. In separate embodimentSj, the DNA- 
5 binding domain is capable of binding to nucleosomal DNA sequences and identification 
of an accessible region is not necessary. In the latter case, chromatin modification, as 
disclosed herein, often results in the generation of an accessible region in cellular 
chromatin in the region of interest, which can facilitate the binding of other molecules, 
either exogenous or endogenous. Exogenous molecules whose binding can be facilitated 

10 by the generation of an accessible region through chromatin modification include, but are 
not limited to, minor groove binders, major groove binders, intercalators, small molecule 
therapeutics, nucleic acids, and polypeptides, including fiision polypeptides, preferably 
comprising a zinc finger DNA-binding domain. 

Polynucleotides encoding fiisions between a DNA-binding domain and a 

1 5 component of a chromatin remodeling complex, and methods for their construction, are 
also provided. 

Fusion polypeptides, comprising a DNA-bindiag domain and a component of a 
chromathi remodeling complex, and methods for producing such fiision potypeptides, are 
also provided. In one embodiment, such fiision polypeptides are produced by expressing 
20 a polynucleotide as described in the preceding paragraph in a suitable host cell. 

Methods for binding an exogenous molecule to cellular chromatin, wherein the 
methods comprise targeted modification of chromatin structiure as disclosed herein, are 
also provided. 



25 BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 shows the PGR amplification scheme for production of constructs 
encoding the Vegl and Veg3a DNA-binding domains. 

Figure 2 shows the results of DNA-bmding affinity determination for the Vegl 
DNA-bmding subdomam. Figure 2A shows EMSA analysis. Unbound probe is at the 
30 bottom of the gel and shifted probe (bound to Vegl) is indicated by the arrow to the right 
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binding reaction in which 15 nM of the Vegl -maltose binding protein fusion was used. 
Figure 2B is a graph depictiag Kd. 

Figure 3 shows an autoradiogram of a DNA gel, indicating the existence and 
location of DNasel-hypersensitive sites in the hnniati VEGF-A gene. 
5 Figure 4 is a schematic diagram of the plasmid pSRClb-EP02C. The rightward- 

pointing arrow represents the start site of a transcription unit encoding a fusion protein 
that includes a nuclear localization signal (NLS), a ZFP binding domain targeted to 
nucleotide —862 of the human erythropoietin gene (EP02c ZFP), a portion of the SRCl 
protein from amino acids 781-1385 (SRClb), and a FLAG epitope (Flag). pCMV 
10 represents a CMV promoter. Selected restriction enzyme recognition sites are also 
indicated. 

Figure 5 shows erythropoietin (EPO) levels in transfected and control cells, as 
determined by ELIS A. The bar labeled pSRClb-EP02C represents levels of EPO 
secreted into the medium by cells transfected with a plasmid encoding a fiision between 
15 an EPO-targeted ZFP and a portion of the SRC 1 protein. pCDNA3 . 1 represents secreted 
EPO levels in cells transfected with a control plasmid that does not encode a ZFP-SRCl 
flision. 

Figure 6 is a schematic diagram depicting the structure of a set of fusion 

molecules described in Example 13. NLS refers to a nuclear localization sequence, ZFP- 
20 DBD refers to the VEGF3a/l zinc finger DNA-binding domain, MBD refers to a portion 

of a methyl binding domain protein, DNMT refers to a portion of a DNA N-mefhyl 

transferase protein, and Flag refers to a FLAG epitope. 

Figure 7 shows VEGF levels in transfected and control cells, as determined by 

ELISA. MOCK refers to cells transfected with a vector that does not contain a ZFP- 
25 MBD or ZFP-DNMT fusion. pEGFP-KRAB refers to cells transfected with a gi^een 

fluorescent protein-encoding plasmid. PVF3a/l refers to the VEGF3a/l DNA binding 

domain described in Examples 3 and 13, MBD refers to various methyl binding domain 

proteins. DNMT refers to various DNA N-methyl transferases. 
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DETAILED DESCRIPTION 
Disclosed herein are compositions and methods useful for modifying chromatin 
structure in a predetermined region of interest in cellular chromatin. Modification of 
chromatin structure facilitates many processes involving nucleotide sequence-specific 
5 interaction of molecules with cellular chromatin. In certain embodiments, modification 
of chromatin structure is a prerequisite for binding of a regulatory molecule to its target 
site in cellular chromatin. Such binding can be useful in the regulation of an endogenous 
cellular gene by one or more endogenous and/or exogenous molecules. 

Regulation of gene expression often involves recmitment of a chi-oniatin 

10 remodeling complex to a region of cellular chromatin (e.g. , the promoter of a gene). 

Recruitment can occur, for example, by protein-protein interactions between a sequence- 
specific DNA-binding transcriptional regulatory protein bound at a promoter and a 
component of the remodeling complex. See, for example, Peterson et al (2000) Ciirr, 
Opin, Genet. Devel 10;187-192. Alterations in chromatin structure in the vicinity of the 

1 5 promoter, mediated by the recruited remodeling complex, faciUtate subsequent 

interactions that result in transcriptional activation or repression. However, the region to 
which a remodeling complex can be localized is hmited by the sequence specificity of the 
DNA-biuding transcriptional regulatory protein, since most, if not all, protein 
components of chromatin remodeling complexes do not possess sequence-specific DNA- 

20 binding activity. Thus, it is not easy to target chromatin remodeling to a particulai" region 
of interest in cellular chromatin unless one possesses a protein that is: (1) capable of 
binding to chromatin in or near the region of interest, and (2) capable of interacting with 
at least one component of a multi-subimit chromatin remodeling complex. 

The methods and compositions disclosed herein allow targeted modification of 

25 any region of interest in cellular chromatin, by employing a fusion molecule comprising a 
DNA-binding domain and a component of a chromatin remodehng complex or functional 
fragment thereof. The DNA-bindiag domain is selected or designed to bind to a target 
site within or near tlie region of interest. Any DNA-binding entity having the requisite 
specificity is suitable. In a preferred embodiment, the DNA-binding domain is a zinc 

3 0 finger DNA-binding domain- Binding of the DNA-binding portion of the fusion 
molecule localizes the portion of the fusion molecule comprising a component of a 
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chromatm remodeliag complex to the region of binding, where it interacts with other 
components to reconstitute a functional chromatin remodeling complex in the vicinity of 
the target site. Chromatin remodeUng ensues in the vicinity of the target site, which 
renders the region of binding {e.g., a gene promoter) susceptible to the action of 
5 endogenous regulatory factors, and/or to the regulatory activities of exogenous 
molecules. 

It will be apparent to one of skill in the art tliat targeted remodeling of chromatin 
will facilitate the regulation of many processes involving access of molecules to DNA in 
cellular cliromatin including, but not limited to, replication, recombination, repair, 
10 transcription, telomere function and maintenance, sister chromatid cohesion, and mitotic 
chromosome segregation. For example, targeted integration of exogenous DNA into 
cellular chromatin will be enhanced by chromatin remodeling in the region of the desired 
integration site. 



15 General 

The practice of the disclosure employs, unless otherwise indicated, conventional 
techniques in molecular biology, biochemistry, chromatin structure and analysis, 
computational chemistry, cell culture, recombinant DNA and related fields as are within 
the skill of the art. These techniques are fully explained in the literature. See, for 

20 example, Sambrook et al MOLECULAR CLONING: A LABORATORY MANUAL, Second 
edition. Cold Spring Harbor Laboratory Press, 1989; Ausubel et al, CURRENT 
PROTOCOLS IN MOLECULAR BIOLOGY, John Wiley & Sons, New York, 1987 and periodic 
updates; the series METHODS IN ENZYMOLOGY, Academic Press, San Diego; Wolffe, 
CHROMATIN STRUCTURE AND FUNCTION, Third edition, Academic Press, San Diego, 

25 1998; METHODS IN ENZYMOLOGY, VoL 304, "Chromatin" (P.M. Wassarman and A. P. 
Wolffe, eds.). Academic Press, San Diego, 1999; and METHODS IN MOLECULAR 
BIOLOGY, VoL 119, "Chromatin Protocols" (P.B. Becker, ed.) Humana Press, Totowa, 
1999. 
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Definitions 

Chromatin is the micleoprotein structure comprising the cellular genome. 
Cellular chromatin comprises nucleic acid, primarily DNA, and protein, including 
histones and non-histone chromosomal proteins. The majority of eukaryotic cellular 
5 chromatin exists in the form of nucleosomes, wherein a nucleosome core comprises 

approximately 150 base pairs of DNA associated with an octamer comprising two each of 
histones H2A, H2B, H3 and H4; and Imker DNA (of variable length depending on the 
organism) extends between nucleosome cores. A molecule of histone HI is generally 
associated with the linl<:er DNA. For the purposes of the present disclosure, the term 

10 "chromatin" is meant to encompass aU types of cellular nucleoprotein, both prokaryotic 
and eukaryotic. Cellular chromatin includes both chromosomal and episomal chromatin. 

Chromatin modification, or chromatin remodeling, refers to any process by which 
the structure of chromatin or its constituents is altered. Remodeling can include, for 
example, removal or repositioning of nucleosomes, addition of nucleosomes, changes in 

15 nucleosome density, changes in the path of DNA along the histone octamer, and/or 
changes in higher-order chromatia structure such as, for example, unwinding of the 
chromatm solenoid Chromatin modification can also include modifications to histones 
or nucleic acid which might not necessarily change the stmcture of chromatia as 
assayable by current methods. For example, acetylation or deacetylation of histones , as 

20 well as methylation or demethylation of nucleic acid, are instances of chromatin 
modification. 

A chromosome, as is known to one of skill in the art, is a chromatin complex 
comprising all or a portion of the genome of a cell. The genome of a cell is often 
characterized by its karyotype, which is the collection of all the chromosomes that 
25 comprise the genome of the cell. The genome of a cell can comprise one or more 
chromosomes. 

An episome is a replicating nucleic acid, nucleoprotein complex or other structure 
comprising a nucleic acid that is not part of the chromosomal karyotype of a cell. 
Examples of episomes include plasmids and certain viral genomes. 
30 A target site is a nucleic acid sequence that defines a portion of a nucleic acid to 

which a binding molecule will bind, provided sufficient conditions for binding exist. For 
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example, the sequence 5'-GAATTC-3' is a target site for the Eco RI restriction 
endonuclease. Although binding of a molecule to its target site will generally occur in a 
naked nucleic acid molecule, a binding molecule may be incapable of binding to its target 
site in cellular chromatin, as a result of some aspect of the structure of the chromatin in 
5 which tlie target site is located which makes the target site inaccessible to the binding 
molecule. In other cases, factors in addition to a target site may be required for binding 
of a molecule to a nucleic acid at the target site. For instance, binding of a molecule to a 
polynucleotide comprising a target site may require both a particular nucleotide sequence 
and a particular protein composition adjacent to, or in the vicinity of, the target site. 

10 Conditions such as, for example, temperature, pH, and ionic strength can also affect 
binding of a molecule to its target site. 

Target sites for various transcription factors are known. See^ for example, 
Wingender et ah (1997) Nucleic Acids Res. 25:265-268 and the TRANSFAC 
Transcription Factor database at http://transfac>gb£de/TRANSFAC/- accessed on April 

15 13, 2000. In general, target sites for newly-discovered transcription factors, as well as 

other types of exogenous molecule, can be determined by methods that are well-known to 
those of skill in the art such as, for example, electrophoretic mobility shift assay, 
exonuclease protection, DNase footpriuting, chemical footprinting and/or durect 
nucleotide sequence determination of a binding site. See, for example, Ausubel et aL, 

20 supra. Chapter 12. 

A binding site in cellular cluromatin is a region at which a particular molecule, for 
example a protein, will bind to a tai-get site in the chromatin. A binding site will 
generally comprise a target site, but not every target site will constitute a binding site in 
cellular chromatin. For example, a target site may be occluded by one or more 

25 chromosomal components, such as histones or nonliistone proteins, or might be rendered 
inaccessible to its binding molecule because of nucleosomal or higher-order chromatin 
structure. On the other hand, the presence of one or more chromosomal proteins may be 
required, in addition to a target site, to defibae a binding site. 

An accessible region is a site in a chromosome, episome or other cellular structure 

30 comprising a nucleic acid, in which a target site present in the nucleic acid can be bound 
by an exogenous moleciile which recognizes the target site. Without wishing to be bound 
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by any particular theory, it is believed that an accessible region is one that is not 
packaged into a nucleosomal stracture. The distinct structure of an accessible region can 
often be detected by its sensitivity to chemical and enzymatic probes, for example, 
nucleases. 

5 An exogenous molecule is a molecule that is not normally present in a cell, but 

can be introduced into a cell by one or more genetic, biochemical or other methods. 
'"Normal presence in the cell" is determined with respect to the particular developmental 
stage and environmental conditions of the cell Thus, for example, a molecule that is 
present only during embryonic development of muscle is an exogenous molecule with 

10 respect to an adult muscle cell. Similarly, a molecule induced by heat shock is an 

exogenous molecule with respect to a non-heat-shocked cell. An exogenous molecule 
can comprise, for example, a functioning version of a malfunctioning endogenous 
molecule or a malfunctioning version of a normally-fimctiorring endogenous molecule. 
An exogenous molecule can be, among other things, a small molecule, such as is 

1 5 generated by a combinatorial chemistry process, or a macromolecule such as a protein, 
nucleic acid, carbohydrate, lipid, glycoprotein, lipoprotien, polysaccharide, any modified 
derivative of the above molecules, or any complex comprising one or more of the above 
molecules. Nucleic acids include DNA aad RNA, can be single- or double-stranded; can 
be hnear, branched or circular; and can be of any length. Nucleic acids include those 

20 capable of forming duplexes, as well as triplex-forming nucleic acids. See, for example, 
U.S. Patent Nos. 5,176,996 and 5,422,251. Proteins include, but are not limited to, DNA- 
binding proteins, transcription factors, chromatin remodeling factors, methylated DNA 
binding proteins, polymerases, methylases, demethylases, acetylases, deacetylases, 
kinases, phosphatases, integrases, recombinases, Hgases, topoisomerases, gyrases and 

25 helicases. 

An exogenous molecule can be the same type of molecule as an endogenoiis 
molecule, e,g., protein or nucleic acid, providing it has a sequence that is different from 
an endogenous molecule. For example, an exogenous nucleic acid can comprise an 
infecting viral genome, a plasmid or episome introduced into a cell, or a chromosome that 
30 is not normally present in the cell. Methods for the introduction of exogenous molecules 
into cells are known to those of skill in the art and include, but are not limited to, lipid- 
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mediated transfer {te., liposomes, including neutral and cationic lipids), electroporation, 
direct injection, cell flision, particle bombardment, calcium phosphate co-precipitation, 
DEAE-dextran-mediated transfer and viral vector-mediated transfer- 
By contrast, an endogenous molecule is one that is normally present in a 
5 particular cell at a particular developmental stage under particular environmental 

conditions. For example, an endogenous nucleic acid can comprise a chromosome, the 
genome of a mitochondrion, chloroplast or other organelle, or a naturally-occiming 
episomal nucleic acid. Additional endogenous molecules can include proteins, for 
example, transcription factors and components of chromatin remodeling complexes. 

10 A fusion molecule is a molecule in which two or more subunit molecules are 

linked, preferably covalently. The subunit molecules can be the same chemical type of 
molecule, or can be different chemical types of molecules. Examples of the first type of 
fiision moleciile include, but are not limited to, fusion polypeptides (for example, a fusion 
between a ZFP DNA-binding domain and a transcriptional activation domain) and fusion 

15 nucleic acids (for example, a nucleic acid encoding the fusion polypeptide described 

supra). Examples of the second type of fusion molecule include, but are not limited to, a 
fusion between a triplex-forming nucleic acid and a polypeptide, and a fusion between a 
minor groove binder and a nucleic acid. In a preferred embodiment, a fusion molecule is 
a nucleic acid which encodes a ZFP DNA-binding domain in operative linkage with a 

20 component of a chromatm remodeling complex or functional fragment thereof. 

A gene, for the purposes of the present disclosure, includes a DNA region 
encoding a gene product (see infra), as well as all DNA regions which regulate the 
production of the gene product, whether or not sixch regulatory sequences are adjacent to 
coding and/or transcribed sequences. Accordingly, a gene includes, but is not necessarily 

25 limited to, promoter sequences, terminators, translational regulatory sequences such as 
ribosome binding sites and internal ribosome entry sites, enhancers, silencers, insulators, 
boundary elements, replication origins, matrix attachment sites and locus control regions. 

Gene expression refers to the conversion of the information, contained in a gene, 
into a gene product. A gene product can be the direct transcriptional product of a gene 

30 {e.g.^ mRNA, tRNA, rRNA, antisense RNA, ribozyme, stmctural RNA or any other type 
of RNA) or a proteia produced by translation of a mRNA. Gene products also include 

17 
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RNAs which are modified, by processes such as capping, polyadenylation, methylation, 
and editing, and proteins modified by, for example, methylation, acetylation, 
phosphorylatiotx, ubiquitioation, ADP-iibosylation, myristilation, and glycosylation. 
Modulation of gene expression refers to a change in the activity of a gene. 
5 Modulation of expression can include, but is not limited to, gene activation and gene 
repression. 

Gene activation is any process which results in an increase in production of a gene 
product, A gene product can be either RNA (including, but not limited to, mRNA, 
rRNA, tRNA, and structural RNA) or protein. Accordingly, gene activation includes 

10 those processes which increase transcription of a gene and/or translation of a mRNA. 
Examples of gene activation processes which increase transcription include, but are not 
limited to, those which facilitate fomiation of a transcription initiation complex, those 
which increase transcription initiation rate, those which increase transcription elongation 
rate, those which increase processivity of transcription and those which relieve 

1 5 transcriptional repression (by, for example, blocking the binding of a transcriptional 

repressor). Gene activation can constitute, for example, inhibition of repression as well 
as stimulation of expression above an existing level. Examples of gene activation 
processes which increase translation include those which increase translational initiation, 
those which increase translational elongation and those which increase mRNA stability, 

20 In general, gene activation comprises any detectable increase in the production of a gene 
product, preferably an increase in production of a gene product by about 2~fold, more 
preferably firom about 2- to about 5 -fold or any integer therebetween, more preferably 
between about 5- and about 10-fold or any integer therebetween, more preferably 
between about 10- and about lO-fold or any integer therebetween, still more preferably 

25 between about 20- and about 50-fold or any integer therebetween, more preferably 

between about 50- and about 100-fold or any integer therebetween, more preferably 100- 
fold or more, 

G ene repression is any process which results in a decrease in production of a gene 
product. A gene product can be either RNA (including, but not limited to, mRNA, 
30 rRNA, tRNA, and structural RNA) or protein. Accordingly, gene repression includes 
those processes which decrease transcription of a gene and/or translation of a mRNA, 

18 
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Examples of gene repression processes which decrease transcription include, but are not 
limited to, those which inhibit formation of a transcription initiation complex, those 
which decrease transcription initiation rate, those which decrease transcription elongation 
rate, those which decrease processivity of transcription and those which antagonize 
5 transcriptional activation (by, for example, blocking the binding of a transcriptional 

activator). Gene repression can constitute, for example, prevention of activation as well 
as iiihibition of expression below an existing level. Examples of gene repression 
processes which decrease ti'anslation include those which decrease translational initiation, 
those which decrease translational elongation and those which decrease mRNA stability. 

10 Transcriptional repression includes both reversible and irreversible inactivation of gene 
transcription. In general, gene repression comprises any detectable decrease in the 
production of a gene product, preferably a decrease in production of a gene product by 
about 2-fold, more preferably from about 2- to about 5-fold or any integer therebetween, 
more preferably between about 5- and about 10-fold or any integer therebetween, more 

15 preferably between about 10- and about 20-fold or any integer therebetween, still more 
preferably between about 20- and about 50-fold or any integer therebetween, more 
preferably between about 50- and about 100-fold or any integer therebetween, more 
preferably 100-fold or more. Most preferably, gene repression results in complete 
inhibition of gene expression, such that no gene product is detectable. 

20 Eucaryotic cells include, but are not limited to, fimgal cells (such as yeast), plant 

cells, animal cells, mammalian cells and human cells. 

A region of interest is any region of cellular chromatin, such as, for example, a 
gene or a non-coding sequence within or adjacent to a gene, in which it is desirable to , 
for example, modify chromatin structure and/or bind an exogenous molecule. A region 

25 of interest can be present in a chromosome, an episome, an organellar genome (e.g., 
mitochondrial, chloroplast), or an infecting viral genome, for example. A region of 
interest can be within the coding region of a gene, within transcribed non-coding regions 
such as, for example, leader sequences, trailer sequences or introns, or within non- 
transcribed regions, either upstream or downstream of the coding region, 

30 The terms "operative linkage" and "operatively Unked" are used with reference to 

a juxtaposition of two or more components (such as sequence elements), in which the 
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components are arranged such that both components function normally and allow Hie 
possibility that at least one of the components can mediate a function that is exerted upon 
at least one of the other components. By way of illx^tration, a transcriptional regulatory 
sequence, such as a promoter, is operatively linked to a coding sequence if the 
5 transcriptional regulatory sequence controls the level of transcription of the coding 

sequence in response to the presence or absence of one or more transcriptional regulatory 
factors. An operatively linked transcriptional regulatory sequence is generally joined in 
. CIS witli a coding sequence, but need not be directly adjacent to it. For example, an 
enhancer can constitute a transcriptional regulatory sequence tliat is operatively-liiiked to 

10 a coding sequence, even though they are not contiguous. 

With respect to fusion polypeptides, the term "operatively hnked" can refer to the 
fact that each of the components performs the same function in linkage to the other 
component as it would if it were not so linked. For example, with respect to a fusion 
polypeptide in which a ZFP DNA-binding domain is fused to a component of a chromatin 

15 remodeling complex (or functional fragment thereof), the ZFP DNA-binding domain and 
the component of the chromatin remodeling complex (or functional firagment thereof) are 
in operative linkage if, in flie fiision polypeptide, the ZFP DNA-binding domain portion 
is able to bind its target site and/or its binding site, while the component of the chromatin 
remodeling complex (or functional fragment thereof) is able to interact with other 

20 members of its cognate chromatin remodeling complex. 

A functional fragment of a protein, polypeptide or nucleic acid is a protein, 
polypeptide or nucleic acid whose sequence is not identical to the full-length protein, 
polypeptide or nucleic acid, yet retains the same function as the full-length protein, 
polypeptide or nucleic acid. A functional fragment can possess more, fewer, or the same 

25 number of residues as the corresponding native molecule, and/or can contain one ore 

more amino acid or nucleotide substitutions. Methods for determining the function of a 
nucleic acid {e.g., coding function, ability to hybridize to another nucleic acid) are well- 
known in the art. Similarly, methods for determining protein function are well-known. 
For example, the DNA-binding function of a polypeptide can be determined, for 

30 example, by filter-binding, electrophoretic mobility-shift, or immunoprecipitation assays. 
See Ausubel et ah, supra. The ability of a protein to interact with another protein can be 
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determined, for example, by co-imm.iinoprecipitation, two-hybrid assays or 
complementation, both genetic and biochemical. See, for example. Fields et al (1989) 
Nature 340:245-246; U.S. Patent No. 5,585,245 and PCT WO 98/44350. 

5 DNA-Binding domains 

In certain embodiments, the compositions and methods disclosed herein involve 
fusions betv^^een a DNA-binding domain and a component of a chromatin remodeling 
complex. In additional embodiments, the compositions and methods disclosed herein 
involve fusions between a DNA-binding domain and a domain which participates in 

10 modulation of gene expression such as, for example a transcriptional activation domain 
or a transcriptional repression domain. A DNA-binding domain can comprise any 
molecular entity capable of sequence-specific binding to chromosomal DNA. Binding 
can be mediated by electrostatic interactions, hydrophobic interactions, or any other type 
of chemical interaction. Examples of moieties which can comprise part of a DNA- 

15 binding domain include, but are not limited to, minor groove binders, major groove 
binders, antibiotics, intercalating agents, peptides, polypeptides, oligonucleotides, and 
nucleic acids. An example of a DNA-binding nucleic acid is a triplex-forming 
oligonucleotide. 

Minor groove binders include substances which, by virtue of their steric and/or 
20 electrostatic properties, interact preferentially with the minor groove of double-stranded 
nucleic acids. Certain minor gioove binders exhibit a preference for particular sequence 
compositions. For instance, netropsin, distamycin and CC-1065 are examples of minor 
groove binders which bind specifically to AT-rich sequences, particularly runs of A or T. 
WO 96/32496. 

25 Many antibiotics are known to exert their effects by binding to DNA. Binding of 

antibiotics to DNA is often sequence-specific or exhibits sequence preferences. 

Actinomycin, for instance, is a relatively GC-specific DNA binding agent. 

In a preferred embodiment, a DNA-binding domain is a polypeptide. Certain 

peptide and polypeptide sequences bind to double-stranded DNA in a sequence-specific 
30 manner. For example, transcription factors participate in transcription initiation by RNA 

Polymerase II through sequence-specific interactions with DNA in the promoter and/or 
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enhancer regions of genes. Defined regions within the polypeptide sequence of various 
transcription factors have been shown to be responsible for sequence-specific binding to 
DNA, See, for example, Pabo et al (1992) Ann, Rev, Biochem. 61:1053-1095 and 
references cited therein. These regions include, but are not limited to, motifs known as 
5 leucine zippers, helix-loop-helix (HLH) domains, helix-tum-helix domains, zinc fingers, 
(i-sheet motifs, steroid receptor motifs, bZIP domains homeodomains, AT-hooks and 
others. The amino acid sequences of these motifs are known and, in some cases, amino 
acids that are critical for seqixence specificity have been identified. Polypeptides 
involved in other process involving DNA, such as replication, recombination and repair, 
10 will also have regions involved in specific interactions with DNA. Peptide sequences 

involved in specific DNA recognition, such as those found in transcription factors, can be 
obtained through recombinant DNA cloning and expression techniques or by chemical 
synthesis, and can be attached to other components of a fiision molecule by methods 
known in the art. 

15 Proteins containing methyl binding domains, or fimctional fragments thereof, can 

also be used as DNA-binding domains. Methyl binding domain proteins recognize and 
bind to CpG dinucleotide sequences in which the C residue is methylated. Proteins 
containing a methyl-binding domain include, but are not limited to, MBDl, MBD2, 
MBD3, MBD4, MeCPl and MeCP2. See, for example. Bird et al (1999) Cell 99:451- 

20 454. 

Additionally, DNA methyl transferases, which methylate the 5-position of C 

residues in CpG dinucleotides such as, for example, DNMTl, DNMT2, DNMT3a and 

DNMTSb, or functional fiagments thereof, can be used as a DNA-binding domain. 

Furthermore, enzymes which demethylate methylated CpG , or functional fi-agments 
25 thereof, can be used as a DNA-binding domain. Fremant et al (1997) Nucleic Acids Res, 

25:2375-2380; Okmio et al (199S) Nature Genet 19:219''220; Bhattacharya a/. 

(1999) Nature 397:579-583; and Robertson et al (2000) Carcinogenesis 21:461-467. 

In a more preferred embodiment, a DNA-binding domain comprises a zinc finger 

DNA-bmding domain. See, for example, Miller et al (1985) EMBO J, 4:1609-1614; 
30 Rhodes et al (1993) Scientific American Feb.:56-65; and Klug (1999) J, Mol Biol 

293:21 5-21 8. M one embodiment, a target site for a zinc finger DNA-binding domain is 
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identified according to site selection rules disclosed in co-owned WO 00/42219. ZFP 
DNA-binding domains are designed and/or selected to recognize a particular target site as 
described in co-owned WO 00/42219; WO 00/41566; and U.S. Serial Nos. 09/444,241 
filed November 19, 1999 and 09/535,088 filed March 23, 2000; as well as U.S. Patents 
5 5,789,538; 6,007,408; and 6,013,453; and PCT pubUcations WO 95/19431, 
WO 98/54311, WO 00/23464 and WO 00/27878. 

Certain DNA-binding domains are capable of binding to DNA that is packaged in 
nucleosomes. for example, Cordingleyef a/. (1987) Ce// 48:261-270; Pina et aL 
(1990) Ce// 60:719-731; and Cirillo a/. (1998) £:A//50 J, 17:244-254. Certain ZFP- 

10 containing proteins such as, for example, members of the nuclear hormone receptor 
superfamily, are capable of binding DNA sequences packaged into chromatin. These 
include, but are not Umited to, the glucocorticoid receptor and the thyroid hormone 
receptor. Archer et al (1992) Science 255: 1573-1 576; Wong et al. (1997) EMBO J. 
16:7130-7145, Other DNA-binding domains, including certain ZFP-containing binding 

15 domains, require more accessible DNA for binding* In the latter case, the binding 
specificity of the DNA-binding domain can be determined by identifying accessible 
regions in the cellular chromatin. Accessible regions can be determined as described in 
co-owned U.S. Patent Apphcation entitled 'Databases of Accessible Region Sequences; 
Methods of Preparation and Use Thereof," reference SI 5, filed even date herewith. A 

20 DNA-binding domain is then designed and/or selected to bind to a target site within the 
accessible region. 

Chromatin remodeling complexes 

Two major types of chromatin modification have been described. The first is 
25 dependent on covalent modification. Covalent modification of histones occurs by 

processes such as, for example, acetylation and deacetylation. Covalent modification of 
DNA is exemplified by methylation of cytosine residues in CpG dinucleotides. The 
second type of modification results in changes in nucleosome location and/or 
conformation, and relies on the activity of ATP-driven chromatin remodeling machines. 
30 Both types of chromatin modification are carried out in vivo by multiprotein complexes. 
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For the purposes of the present disclosure, proteins involved in either of these types of 
chromatin modification can comprise a component of a chromatin remodelmg complex. 

Modifications of the first type ofl:en comprise histone acetylation, catalyzed by a 
complex containing a histone acetyl transferase (EIAT), or histone deacetylation, 
5 catalyzed by a complex contahrtng a histone deacetylase (HDAC). An example of a 
complex involved in this type of chromatin modification is a histone deacetylase 
complex, examples of which include the SINS and Mi-2 complexes. Knoepfler et al 
(1999) Cell 99:447-450, These complexes generally comprise one or more enzymatic 
components {te., a HDAC) as well as one or more non-enzymatic components. Thus, a 

10 component of a chromatin remodeling complex can be either an enzymatic or a non- 
enzymatic component (or a functional fragment of an enzymatic or non-enzymatic 
component) of a complex involved in the covalent modification of histones. 

The second type of chromatin modification is mediated by multiprotein chromatin 
remodeling complexes, exhibits nucleosome-, histone- and/or DNA-dependent ATPase 

15 activity and catalyzes various types of modification of chromatin structure (see infra). 

Grenerallyj a remodeUng complex comprises an enzjrmatic component (an ATPase protein 
subtinit) and one or more non-enzymatic protein subunits. ATPase subnnits are grouped 
into three major families: the SWI/SNF family, the ISWI family, and the Mi-2/CHD 
family. See Tyler et al (1999) Cell 99:443-446. A component of a chromatin 

20 remodeling complex can comprise one of its constituent proteins or a flmctional firagment 
thereof. Thus, a component of a chromatin remodeling complex can be an enzymatic 
component or a non-enzymatic component. 

Enzymatic components of chromatin remodeling complexes include, but are not 
limited to, the following ATPases: SWI2/SNP2, STHl, BRM, hBRM, BRGl, Mi- 

25 2/CHD, ISWI, ISW2, ISWI, and hSNF2h. Tyler et al, supra; Armstrong et al. (1998) 
Curr. Opin. Genet, Dev. 8:165-172; Guschm etal (1999) Curr. Biol 9:R742-746; and 
Wolffe et al (2000) J. Struct Biol 129:102-122. 

Modifications in chromatin stmcture include those which render chromosomal 
sequences more accessible to regulatory factors {le,, formation of "open" chromatin) as 

30 well as those which make chromosomal sequences less accessible (i,e., formation of 
"closed" chromatin). Such modifications can include, for example, removal of 
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nucleosomes from DNA, deposition of nucleosomes onto DNA, repositioning of 
nucleosomes, changes in nucleosome spacing, changes in nucleosome density^ changes in 
the degree and/or nature of the interaction between DNA and liistones in the nucleosome, 
changes in the path of DNA along the surface of the nucleosome^^ and/or changes in 
5 higher-order chromatin structure such as, for example, imwinding of the chromatin 
solenoid. 

In certain embodiments, the compositions and methods disclosed herein involve 
fusions between a DNA-binding domain and a component of a chromatin remodeling 
complex^ as described supra^ or a polynucleotide encoding such a fusion. 

10 Various chromatin remodeling complexes, then components and their activities 

have been identified and characterized in several organisms and cell types. Complexes 
Icnown as SWT/SNF, RSC, IS Wl and ISW2 have been isolated and characterized in yeast 
JnDrosophila, the NURF, CHRAC, ACF and brahma (dSWI/SNF or BRM) complexes 
have been isolated and characterized- Cliromatin remodeling complexes from human 

1 5 cells named brm/BRG (hSWI/SNF), NURD and RSF have been isolated and 
characterized. See, for example. Calms (1998) Trends Biochem, ScL 23:20-25; 
Murchardt et al (1999) J. Mol Biol. 293:185-197; Kingston et al (1999) Genes, Devel 
13:2339-2352 and their cited references. It is likely that, as the field progresses, 
additional chromatin remodeling complexes, and their components, will be discovered 

20 and characterized; the use of such newly-discovered components of chromatra 

remodehng complexes is contemplated by the present disclosure. Exemplary chromatin 
remodeling complexes and their components are now described. 
A. SWI/SNF 

The SWI/SNF chromatin remodeling complex of yeast comprises the SWI2/SNF2 
25 helicase/ATPase and products of the SNF5, SWI3, SWP73, ARP7, ARP9, SWIl, SNF6, 
SWP82, SWP29 and SNFl 1 genes. Arp7 and Arp9 are actin-related proteins. (The 
SWP29 gene product is also known as either TFG-3 or TAF30.) Peterson et al (2000) 
Curr. Opin. Genet DeveL 10:187-192. 
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B. SWI/SISIF homologues 

Several chromatin remodeling complexes, have been isolated based on their 
possession of a subnnit with homology to SWI2, These include the Brahma (BRM) 
complex hi Drosophila, the brm/BRG complexes in mammals, and others. 
5 1. Brahma 

The Drosophila brahma {brm) complex (also known as dS WI/SNF) 
contains an ATPase subunit, homologous to SWI2/SNF2, called brahma {brm\ as well as 
SNRl, BAP155 (moira), BAP60, BAPl 1 1, EAP55, BAP74 and BAP47/ACT1/ACT2 
subnnits. 

10 2. Mammalian brm/BRG complexes 

In humans and mouse, several complexes comprising one of two 
SWI2/SNF2 -homologous ATPases have been characterized. In mice, chromatin 
remodeling complexes contain either of the two SWI2/SNF2 homologues mBRM or 
mBRG-1, along with subimits named mSNFS and mBAF60a. Similarly, in himiaa cells, 

15 either of the two SWI2/SNF2 homologues hBRM (also known as hS]S[F2a) or BRG-1 

(also known as hSNF2p) are present in chromatin remodeling complexes also containing 
the hSNF5 (also known as INI-1), hBAF170, hBAF155, hBAF60a (or hBAF60b or 
hBAF60c), hBAF57, p-actin, hBAF53, hBAF250 (also known as p270) and liBAFl 10 
subunits. 

20 3. Chromatin remodeling complexes active in the regulation of human 

globin genes 

Several chromatin remodeling complexes have been discovered by virtue 
of their participation in the regulation of globin gene expression in human cells. These 
include E-RCl, comprising BRG-1 and the BAF57 protein, and the PYR complex, 
25 comprising hSNF5/INIl, BAF57, BAF60a, and BAFl 70. 
4. RSC 

The RSC complex ("remodels the structure of chromatin"), first identified 
in yeast, is a I5-subunit complex comprising the SWI2/S1S1F2 homologous ATPase 
STHl, along with SFH-1, RSC-8, actin-related proteins, RSC-6 and SAS-S. Two 
30 recently characterized subunits of RSC, denoted Rscl and Rsc2, each contains two 

bromodomains, a BAH ('"bromo adjacent homology") domain and an A/T hook motif. 
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and thus likely participates m the interaction between the RSC complex and chromatin. 
Cairns e^ a/. (1999) M?/. C^// 4:715-723. 

5. ATRX and related prpteins 
5 A family of helicase/ATPase proteins with homology toSNF2 have been 

described. These proteins contain seven conserved domains and are involved in a range 
of cellular functions, including transcription, recombination and repair. The mammalian 
ATRX protein is an example of this group of proteins. See Picketts et aL (1996) Hum. 
MoL Genet. 5:1899-1907. 
10 C. ISWI-containing complexes 

Several chromatin remodeling machines, initially characterized in Drosophila 
cells; contain an ATPase subunit with homology to yeast SWI2, known as ISWI 
("imitation SWT'). 

1> NURF 

1 5 NURF (Kucleosome Remodeling Factor) is a complex of four 

polypeptides, isolated from Drosophila^ that is capable of ATP-dependent jremodeling of 
chromatin. Remodeling by NURF is Sarkosyl-sensitive and nucleosome-dependent (in 
particular, is dependent on histone tails), and can facilitate binding of transcription factors 
to chromatin. Tsukiyama et al (1995) Cell 83:101 1-1020. The components of NURF 

20 include ISWI (a SWI2-related DNA-dependent ATPase, also known as NURF-140), 

NURF-38, NURF-55 and NURF-215. Additional properties of the NURF complex are 
disclosed in Sandaltzopoulos et aL (1999) Meth, Enzymology 304:757-765 and references 
cited tlierein. 

2. CHRAC 

25 CHRAC (Chr omatin Accessibility Complex) possesses ATP-dependent 

nucleosome spacing activity and mediates ATP-dependent accessibility of chromatin to 
restriction endonucleases. Varga Weisze/ a/. (1995) EAiSO 7. 14:2209-2216; Varga 
Weisz et al. (1997) Nature 388:598-602, The CHRAC complex includes the ISWI 
ATPase and four additional polypeptides: pl5, p20, pl75 and DNA topoisomerase 11. 
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3. ACF 

The ACF complex (ATP-utiliziiig chromatin assembly and remodeling 
factor), characterized in Drosophila, is able to facilitate the binding of transcriptional 
activators to chromatin and to affect nucleosome spacing. Ito et al (1997) Cell 90:145- 
5 1 55. ACF contains the ISWI ATPase and three additional polypeptides: pl7, ACFI 
(pl85)andACFn(pl70), 

4. RSF 

The RSF complex (remodeling and spacing factor), found in human cells, 
contains the ISWI homologue hSNF2h and a subunit known as p325. Its activities 
10 include ATP-dependent nucleosome remodeling and spacing. LeRoy et al (1998) 
Science 282:1900-1904. 

5. ISWI 

Chromatin remodeling complexes in yeast, with ATPase submiits 
homologous to ftio JDrosophila ISWI ATPase, include ISWI and ISW2, ISWI contains 
15 the ISWI ATPase subunit, p74, pl05 and pi 10. ISWI has been characterized as 

possessing nucleosome-stimulated ATPase activity and ATP-dependent nucleosome 
disraption and spacing activities. Tsukiyama et al. (1999) Genes Dev, 13:686-697. 

6. ISW2 

The yeast ISW2 complex contains the ISW2 ATPase along with a second 
20 subunit having a molecular weight of 140kD. ISW2 possesses nucleosome-stimulated 
ATPase activity and ATP-dependent nucleosome disruption activity. Tsukiyama et al. 
(1999) supra, 

7. WCRF 

The WCRF chromatin remodeling complex was isolated from human 
25 (HeLa) cells and contains an ISWI-homologous ATPase known as WCRF 135 (SNF2h) 
and a subunit Icnown as WCRF 180, WCRF 1 80 has several hallmarks of a transcription 
factor, including a heterochromatin localization domain, a PHD finger ( a cysteine-rich 
zinc-binding domain) and a bromodomain (a domain reported to be involved in 
interaction withhistones). Bochar et al (2000) Proc. Natl. Acad. ScL USA 97:1038- 
30 1043; Jacobson et al (2000) Science 288:1422-1425. 
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D. Mi-2 coHtaimng complexes 

Chromatin remodeling complexes from humaa (NRDyNURD complex) and 
amphibian cells (Mi-2 complex) contain a nucleosome-dependent ATPase activity called 
Mi-2 (also known as CHD). Additional protein components of the amphibian Mi-2 
5 complex include Mtal-like (a DNA-binding protein homologous to metastasis-associated 
protein), RPD3 (the amphibian homologue of histone deacetylases HDACl and 
HDAC2), RbAp48 (a protein which interacts with histone H4), and MBD3 (a protein 
containing a methylated CpG binding domain). The amphibiaa complex additionally 
contains a serine- and proline-rich subunit, p66. Activities of the amphibian Mi-2 

1 0 complex inchide a nucleosome-dependent ATPase that is not stimulated by free histories 
or DNA, translational movement of histone octamers relative to DNA, and deacetylation 
of core histones within a nucleosome. Guschin et at (2000) Biochenitstry 39:5238-5245. 
Inasmuch as RbAp48 appears to comprise a key structural component of the Mi-2 
complex, it is particularly suitable for fiision with a DNA-binding domain for use in the 

1 5 methods disclosed herein. 

Human NEtD/NURD complexes contain, in addition to Mi-2^ homologues of 
amphibian Mtal-like (MTA-2), RPD3 (HDACl and HDAC2), RbAp48 and MBD3, as 
well as additional proteins, fee Zhang etal (1999) Genes Dev, 13:1924-1935; and 
Komberge^^/. (1999) Curr. Opin. Genet Dev, 9:148-151. 

20 E. DNA methyl transferases and methylated DN A bindin g proteins 

As mentioned above, the methyl-binding-domain protein MBD3 is a component 
of Mi-2-containing chromatia remodeling complexes, MBD3 and related methyl binding 
domain proteins recognize and bind to CpG dinucleotide sequences in which the C 
residue is methylated. Thus MBD proteins are capable of recruiting histone deacetylases 

25 to regions of chromatin rich in methylated CpG. Accordingly, a MBD protein can 
comprise a component of a chromatin remodeling complex. Proteins containing a 
methyl-binding domain iaclude, but are not limited to, MBDl, MBD2, MBD3, MBD4, 
MeCPl andMeCP2. ^ee, for example. Bird a/. (1999) Ce// 99:451-454. 

Additionally, DNA methyl transferases^ which methylate the 5-position of C 

30 residues in CpG dinucleo tides such as, for example, DNMTl, DNIVIT2, DNMT3a and 
DNIV[T3b5 can be used as components of a chromatia remodeliag complex. 
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Not all remodeling complexes have the same activities and the same effects on 
chromatm structure. It is possible that, as more sensitive ^say methods are developed 
and/or more loosely-bound subunits or accessory factors are identified, the various 
chromatin remodeling complexes will be found to possess common activities. 
5 Accordingly, the activities attributed herein to individual chromatin remodeling 
complexes should not be constmed as luniting. 

Nonetheless, it appears, from the information available to date, that each cell type 
contains a multiplicity of chromatin remodeling complexes which can share certain 
common subunits, and that the composition of a chromatin remodeling complex can vary 

10 with cell type. The nmnber of polypeptide subunits in a chromatin remodeling complex 
varies over a wide range, from two in the ISW2 and RSF complexes to over 1 5 in the 
yeast RSC complex. It also appears to be the case that different chromatin remodeling 
complexes can have partially overlapping activities (i.e. , that a degree of functional 
redundancy exists among different chromatin remodehng complexes). The present 

15 disclosure is therefore intended to embrace any and all polypeptides present in any type 
of chromatin remodeling complex, currently known or to be discovered. 

Histone acetylase and deacetylase complexes 

In the process of gene activation, binding of chromatin remodeling complexes to 
20 chromatin generally precedes bindiug of histone acetyl transferase (HAT) and/or histone 
deacetylase (HDAC) complexes, suggesting that HAT and HD AC complexes are 
recmited by the chromatin remodeling complex, or that remodeled cln-omatin is more 
conducive to binding of HAT and HDAC complexes. See^ for example, Cosma et ah 
(1*999) Cell 97:299-311; Krebs et al (1999) Genes Dev. 13:1412-1421. Accordingly, in 
25 one embodiment of the claimed methods, chromatin modification facilitates binding of a 
. HAT- or HDAC-containing complex. In this way, chromatin modification facilitates 
covalent modification of nucleosomal histones by acetylation or deacetylation. Histone 
acetylation is generally correlated with transcriptional activation; while deacetylation of 
histones is generally associated with transcriptional repression. 
30 Numerous HAT en2ymes have been described, including budding yeast GcnSp, 

which is required for expression of a subset of the yeast genome, its mammalian 
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orthologue CREB-binding protein (CBP), p300 (both of the latter two used as 
coactivators by a wide variety of mammalian transcription factors), TAFn250 (a 
component of the basal transcriptional machinery), and steroid receptor coactivator 1 
(SRC-1), which potentiates transcriptional activation by a number of nuclear hormone 
5 receptors, Kouzarides (1999)5w/7ra; Cheung a/. (2000) Curr. Opin. CellBioL 12:326- 
333; and Sterner et al (2000) supra. 

Two major classes of fimctionally distinct HDACs have been identified in higher 
eukaryotes. Class I includes HDACl , HDAC2 and HDACS, which are homologous to 
the yeast Rpd3 histone deacetylase. Class II includes HDAC4, HDAC5 and HDAC6; 

10 and are homologous to the yeast Hdal histone deacetylase. Ng et ah, supra. 

In another embodiment, a ZFP DNA-binding domain is fused to a histone acetyl 
transferase or to a histone deacetylase, to effect chromatin modification in the form of 
covalent modification (acetylation or deacetylation) of histones. In yet another 
embodiment, modification of chromatin by a chromatin remodeling complex is followed 

15 by binding of a ZFP-HAT fusion or a ZFP-HDAC fusion, to establish an active or 
inactive chromatin state, respectively. 

In additional embodiments, a fusion between a DNA-binding domain and a 
protein that is a component of a HAT- or HDAC-containing complex is provided. In this 
way, it is possible to recruit HAT or HDAC activity to a region of interest in cellular 

20 chromatin, depending of the sequence specificity of the DNA-binding domain, HAT- 
and HDAC-containing complexes, and their component polypeptide subunits, have been 
described. See^ for example, Qnmstein (1997) Nature 389:349-352; Hartzog et al 
(1997) Curr, Opin, Genet. De\^el 7:192-198; Kadonaga (1998) C^// 92:307-313; Kuo et 
al (1998) BioEssays 20:615-626; Mizzzen et al (1998) Cell Mol Life Set 54:6^20; 

25 Struhl (1998) Genes Devel 12:599-606; Workman etal (1998) v^/a?^. Rev, Biochem, 
67:545-579; l^^^etal (1999) Trends Biochem, Sci. 25:121-126; and Knoepfler t^r a/. 
(1999) Cell 99:447-450. Accordingly components of HAT- and HDAC-containing 
complexes are well-known to those of skill in the art. 

For example, there are several HAT-oontaining complexes in yeast, one of which 

30 is the SAGA complex (Spt-Ada-Gcn5-acetyltransferase). Grant et al (1997) Genes 
Devel 11:1640-1650; Ikedae? a/. (1999) MoZ. Cell Biol 19:855-863. 
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HDAC-contaioing complexes include the Sin3 complex, which is conserved in 
organisms from yeast to mammals. The components of the yeast Sin3 complex include 
SinSpj RPD3 (a histone deacetylase), RbAp48, and RbAp46. The components of the 
mammalian Sia3 complex include mSinSA, mSinSB, HDACl, HDAC2, RbAp48, 
5 RbAp49, SAP30 and SAP18, Zhang et al (1998) Mol Cell 1:1021-1031. Sin3 proteins 
from yeast, Drosophila, and vertebrates contain a PAH (paired amphipathic helices) 
domain , comprising four conserved repeats which form two amphipathic helices 
separated by a flexible linker. HDACl, HDAC2 and RPD3 are histone deacetylases. 
The RbAp48 and RbAP49 proteins interact with histones. SAP30 and SAP18 are 

1 0 specificity determinants . 

Another HD AC -containing complex (which also possess chromatin remodeUng 
activity, see supra) is the Mi-2 complex* Several Mi-2 complexes liave been described in 
humans and amphibians. The mammalian Mi-2 complex (also known as NuRD) 
comprises the following polypeptides: Mi-2 (also known as CHD), HDACl, HDAC2j 

15 MTA-2 and MBD3. See, for example, Ahringer (2000) Trends Genet 16:351-356. The 
amphibian Mi-2 complex comprises Mi-2, Mtal-like (homologous to mammalian 
MTA2), p66, RbAp48, RPD3 and MBD3. Guschin et al (2000) Biochemistry 39:5238- 
5245. Binding of the methylated DNA binding protein present in this complex (MBD3) 
to methylated CpG dinucleotides in upstream regulatory regions localizes the complex 

20 and its associated HDAC activity to methylated genes. Thus, it is beheved that the Mi-2 
complex is involved in the repression of genes whose upstream DNA is methylated at 
CpG dinucleotides 

Coactivators and corepressors wliich associate with the Sin3 complex to aid in 
targeting and in its interaction with receptors and other transcriptional regulatory proteins 

25 have been described. Examples include, but are not limited to, the vertebrate N-CoR, Rb 
and SMRT proteins and their homologues, as well as the Drosophila SMRTER and 
Groucho proteins and their homologues. For tlie purposes of the present disclosure, such 
coactivators and corepressors are considered to be components of chromatin remodeling 
complexes, inasmuch as they are capable of targeting various types of chromatin 

30 modification, if fused to a DNA-binding domain. 
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For additional details and lists of HAT- and HDAC-containing complexes and 
proteins with which they interact, see Knoepfler et al, supra\ Ng et ah, supra\ and 
Ahringer, supra. 



5 

Hormone receptor functional domains 

The thyroid hormone receptor (TR) is a member of the nuclear hormone receptor 
superfamily and is normally bound constitutively to its target genes. The effect of TR 
binding (i.e., either repression or activation of gene expression) ordinarily depends upon 

10 the presence or absence of its ligand, thyroid hormone (T3). In the absence of T3 tlie 
receptor generally represses gene expression to a level below the basal level. A number 
of proteins have been identified that are recruited by the unUganded receptor and are 
believed to constitute a repressive complex. Examples of such proteins include SMRT 
and NCoR, which interact directly with the receptor, as well as Sin3, which interacts with 

15 SMRT/NCoR. Sia3 also interacts with a number of histone deacetylases, for example, 
HDACs 1 through 8 (some of which may also interact directly with TR). Recruitment of 
histone deacetylases by DNA-bound TR is believed to play a major role in its ability to 
confer repression; however, it is also possible that repressive factors other than HDACs 
are recruited by TR, 

20 Binding of hgand to DNA-bound TR results in the decay of the repressive 

complex associated with the TR and recruitment of activating factors to the DNA-boimd, 
ligand-boiind TR. Such activating factors include, but are not limited to, the histone 
acetyltransferases SRC-1, CBP/p300 and P/CAF. Oligomeric activation complexes can 
also be recruited by ligand-bound TR, such as, for example, DRJP and ARC. Rachez et 

25 aL (1999) Nature 398:824-827; and Naar et al (1999) Nature 398:828-832. These have 
been shown to interact with other nuclear hormone receptors, in response to ligand 
binding, and facilitate activation of gene expression in the context of a chromatin 
template. Another member of the nuclear recq>tor family, the glucocorticoid receptor 
(GR), recruits the hBRGl/BAF chromatin remodehng complex m response to ligand 

30 binding. Fiyer a/. (1998) iVaiwre 393:88-91. 
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TR and related nuclear receptors are modular proteins comprising an amino- 
terminal region (of undefined function), a central DNA binding domain and a carboxy- 
terminal ligand binding domain (LBD). The LED, in addition to binding hormone, is 
responsible for interactions with both the repressive and activating factors described 
5 above. When the LBD is fused to a heterologous DNA binding domain (Gal4), it 

mediates repression of a target promoter containing a Gal4 binding site. CoUingwood et 
al (1998) EMBO J. 17:4760-4770. In addition, T3-dependent activation of transcription 
can be achieved using a fusion of the TR LBD with the Gal4 DNA-binding domain Tone 
etal (1994) Biol Chem, 269:31,157-31461. 

10 Knowledge of the structure of the LBD of TR and related nuclear receptors, 

together with the results of mutagenesis studies, can be used to design mutant receptors 
whose repression and activation activity are impervious to hormone concentration. For 
example, single amino acid mutants of TR that are unable to bind physiological levels of 
T3 {e,g, G344E, A430M, and A276I) recruit corepressors to their bmding site. 

15 Colhngwood et al (1994) Mol Endocrinol 8:1262-1277; Collingwood etal (1998) 
supra. Conversely, mutations causing conformational changes in the ligand binding 
domain that mimic those induced by hormone binding have been identified in the 
estrogen receptor {e.g. L536P and Y541D/E/A) and generate constitutively activating 
forms of the receptor. Ensetal (1997) Mol Cell Biol 17:4644''4653; Whites/ a/. 

20 (1997) EMBO J. 16: 1427-1435. 

Accordingly, a mutant nuclear honnone receptor LBD derived, for example, from 
TR or GR can be used as a component of a fusion with a DNA-binding domain, to recxoiit 
activating or repressing protein complexes to a region of interest in cellular cliromatin. 
Certain naturally-occurring mutant LBDs are available; and new mutants can be 

25 constructed by methods well-known to those of skill in the art. The site of action of such 
complexes is determined by the specificity of the DNA-binding domain; while their 
activity is determined by the nature of the mutation to the LBD and is independent of 
hgand concentration. For instance, a fusion comprising a LBD that has been mutated 
such that it is imable to bind hormone will facilitate fomiation of repressive complexes; 

30 while a fusion molecule comprising a LBD mutation that changes the conformation of the 
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LBD such that it resembles a ligand-bound LBD will stimulate the formation of 
complexes that facilitate transcriptional activation. 

Thus, for the purposes of the present disclosure, a mutant nuclear hormone 
receptor LBD can be considered a component of a chromatin remodeling complex. 

5 

Construction and delivery of fusion molecules 

The methods and compositions disclosed herein include fusion molecules 
comprising a DNA-binding domain and a component of a chromatin remodeling 
complex. The component of a chromatin remodeling complex can be either an enzymatic 

10 component or a non-enzymatic component. Without wishing to be bound by theory, it is 
believed that a fusion molecule comprising an enzymatic component will result in 
modification of a more limited region of cellular chromatin, compared to a fusion 
molecule comprising a non-enzymatic component. This is because, when the enzymatic 
component is directly fused to a DNA-biudtug domain, its activity is regionally restricted 

15 to the vicinity of the target site of the DNA-binding domaiiL (A degree of flexibility 

might be achieved by providing a linker sequence between the enzymatic component and 
the DNA-binding domain.) By contrast, if the fusion molecule comprises a non- 
enzymatic component, there are likely to be several proteins intervening between the 
DNA-binding domain (and, hence, the target site in the chromatin) and the enzymatic 

20 component of the reconstituted chromatin remodeling complex. This potentially allows a 
wider area of action of the enzymatic component, which could result in remodeling of 
more extensive sections of chromatin. 

Fusion molecules are constructed by methods of cloning and biochemical 
conjugation that are well-known to those of skill in the art. Fusion molecules comprise a 

25 DNA-binding domain and a component of a chromatin remodeling complex or a 
functional fragment thereof. Fusion molecules also optionally comprise nuclear 
locaUzation signals (such as, for example, that from the SV40 medium T-antigen) and 
epitope tags (such as, for example, FLAG and hemagglutinin). Fusion proteins (and 
nucleic acids encoding them) are designed such that the translational reading frame is 

30 preserved among the components of the fusion. See Examples 2 and 4, infra for 
additional details on the construction of fusion molecules. 
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Fusions between a polypeptide component of a chromatin remodeling complex 
(or a functional fragment thereof) on the one hand, and a non-protein DNA-binding 
domain {e.g,^ antibiotic, intercalator, minor groove binder, nucleic acid) on the other, are 
constructed by methods of biochemical conjugation known to those of skill in the art. 
5 See, for example, the Pierce Chemical Company (Rockford, IL) Catalogue. Methods and 
compositions for making fusions between a minor groove binder and a polypeptide have 
been described. Mapp etal (2000) Proc. Natl Acad. Set USA 97:3930-3935. 

In certain embodiments, a flision between a polypeptide DNA-binding domain 
and a component of a chromatin remodeling complex (or functional fragment thereof) is 

10 encoded by a fusion nucleic acid. In such cases, the nucleic acid can be cloned into 

intermediate vectors for traasformation into prokaryotic or eukaryotic cells for repUcation 
and/or expression. Intermediate vectors for storage or manipulation of the fusion nucleic 
acid or production of fusion protein can be prokaryotic vectors, {e,g.^ plasmids)j, shuttle 
vectors, insect vectors, or viral vectors for example. A fusion nucleic acid can also 

1 5 cloned into an expression vector, for administration to a bacterial ceU, fungal cell, 

protozoal cell, plant cell, or animal cell, preferably a mammalian cell, more preferably a 
human cell. 

To obtain expression of a cloned fusion nucleic acid, it is typically subcloned into 
an expression vector that contains a promoter to direct transcription. Suitable bacterial 

20 and eukaryotic promoters are well known in the art and described, e.g., in Sambrook et 
aL\ supra; Ausubel et al, supra; and Kriegler, Gene Transfer and Expression: A 
Laboratory Manual (1990). Bacterial expression systems are available in, e.g., E, coli^ 
Bacillus sp., and Salmonella. Palva et al (1983) Gene 22:229-235. Kats for such 
expression systems are commercially available. Eukaryotic expression systems for 

25 mammahan cells, yeast, and insect cells are well known in the art and are also 

commercially available, for example, from Invitrogen, Carlsbad, CA and Clontech^ Palo 
Alto, CA. 

The promoter used to direct expression of a ftision nucleic acid depends on the 
particular application. For example, a strong constitutive promoter is typically used for 
30 expression and purification of a fiision protein. In confrast, when a fusion protein is used 
in vivo, either a constitutive or an inducible promoter is used, depending on the particular 
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use of the fusion protein. In addition, a weak promoter can be used, such as HSV TK or a 
promoter having similar activity. The promoter typically can also include elements that 
are responsive to transactivation, e.g., hypoxia response elements;, Gal4 response 
elements, lac repressor response element^ and small molecule control systems such as tet- 
5 regulated systems and the RU-486 system. See, e.g., Gossen et al (1992) Proc, Natl. 
Acad. Sci USA 89:5547-5551; Oligino et a/.(1998) Gene Ther. 5:491-496; Wang et al 
(1997) Gene Ther. 4:432-441; Neering et al (1996) Blood 88:1147-1155; and Rendahl 
etal {199%) Nat, BiotechnoL 16:757-761. 

In addition to a promoter, an expression vector typically contains a transcription 

10 unit or expression cassette that contains additional elements required for the expression of 
the nucleic acid in host cells, either prokaryotic or eukaryotic. A typical expression 
cassette thus contains a promoter operably linked, e.g., to the fusion nucleic acid 
sequence, and signals required, e.g., for efficient polyadenylation of the transcript, 
transcriptional termination, ribosome binding, and/or translation tertniaation. Additional 

15 elements of the cassette may include, e.g., enhancers, and heterologous spliced intronic 
signals. 

The particular expression vector used to transport the genetic information into the 
cell is selected with regard to the intended use of the fusion polypeptide, e.g, , expression 
in plants, animals, bacteria, fungi, protozoa etc. Standard bacterial expression vectors 
20 include plasmids such as pBR322, pBR322-based plasmids, pSKF, pET23D, and 

commercially available fusion expression systems such as GST and LacZ. Epitope tags 
can also be added to recombinant proteins to provide convenient methods of isolation, for 
monitoring expression, aad for monitoring cellular and subcellular localization, e.g., 
c-myc or FLAG. 

25 Expression vectors containing regulatory elements from eukaryotic vimses are 

often used in eukaryotic expression vectors, e.g., SV40 vectors, papilloma virus vectors, 
and vectors derived firom Epstein-Barr virus. Other exemplaiy eukaryotic vectors include 
pMSG, pAV009/A+, pMTO10/A+, pMAMneo-5, baculovirus pDSVE, and any other 
vector allowing expression of proteins under the direction of the SV40 early promoter, 

30 SV40 late promoter, metallothionein promoter, murine mammary tumor virus promoter. 
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Rous sarcoma virus promoter, polyhedrin promoter, or other promoters shown effective 

for expression in eukaryotic cells. 

Some expression systems have markers for selection of stably transfected cell 

lines such as thymidine kinase, hygromycin B phosphotransferase, and dihydrofolate 
5 reductase. High-yield expression systems are also suitable, such as baculovirus vectors in 

insect cells, with a fusion nucleic acid sequence under the transcriptional control of the 

polyhedrin promoter or any other strong baculovirus promoter. 

Elements that are typically included in expression vectors also include a replicon 

that functions in E. coli (or in the prokaryotic host, if other than E. coli), a selective 
10 marker, e.g., a gene encoding antibiotic resistance, to permit selection of bacteria that 

harbor recombinant plasmids, and unique restriction sites in nonessential regions of the 

vector to allow insertion of recombinant sequences. 

Standard transfection methods can be used to produce bacterial^ manmialian, 

yeast, insect, or other cell lines that express large quantities of fusion protein, which can 
15 be purified, if desired, using standard techniques. See, e.g., Colley et al. (1989) J. BioL 

Chem, 264:17619-17622; and Guide to Protein Purification, in Methods in Enzymology, 

vol. 182 (Deutscher, ed.) 1990. Transformation of eukaiyotic and prokaryotic cells are 

performed according to standard techniques. See, e.g,, Morrison (1977) J. Bacterial, 

132:349-351; Clark-Curtiss et al (1983) mMethods in Enzymology 101:347-362 (Wu et 
20 al, eds). 

Any procedure for introducing foreign nucleotide sequences into host cells can be 
used. These include, but are not limited to, the use of calcium phosphate transfection, 
DEAE-dextran-mediated transfection, polybrene, protoplast fusion, electroporation, lipid- 
mediated deUvery {e,g., liposomes), microinjection, particle bombardment, introduction 

25 of naked DNA, plasmid vectors, viral vectors (both episomal and integrative) and any of 
the other well known methods for introducing cloned genomic DNA, cDNA, synthetic 
DNA or other foreign genetic material into a host cell (see, e.g,^ Sambrook et al, supra). 
It is only necessary that the particular genetic engineering procedure used be capable of 
successfully introducing at least one gene into the host cell capable of expressing the 

3 0 protein of choice. 
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Conventional viral and non-viral based gene transfer methods can be used to 
introduce nucleic acids into mammalian cells or target tissues. Such methods can be used 
to administer nucleic acids encoding fusion polypeptides to cells in vitro. Preferably, 
nucleic acids are administered for in vivo or ex vivo gene therapy uses. Non- viral vector 
5 deUvery systems include DNA plasmids, naked nucleic acid, and nucleic acid complexed 
with a delivery vehicle such as a liposome. Vital vector delivery systems include DNA 
and RNA vimses, which have either episomal or integrated genomes after delivery to the 
celL For reviews of gene therapy procedures, see, for example, Anderson (1992) Science 
256:808-813; Nabel^/a/. (1993) Trends Biotechnol. 11:211-217; Mitanie^a/. (1993) 
10 Trends Biotechnol 11:162-166; T>i\\on (1992) Trends Biotechnol 11:167-175; Miller 
(1992) Nature 357:455-460; Van Brunt (1988) Biotechnology 6(1 0):1 149-1 154; Vigne 
(1995) Restorative Neurology^ and Neuroscience 8:35-36; Kremer et ah (1995) British 
Medical Bulletin 51(l):31-44; Haddada et aL, in Current Topics in Microbiology and 
Immunology^ Doerfler and Bohm (eds), 1995; and Yu et al (1994) Gene Therapy 1 : IS- 
IS 26. 

Methods of non-viral delivery of nucleic acids include lipofection, microinjection, 
balhstics, virosomes, liposomes, inaanunoUposomes, polycation or lipidrnucleic acid 
conjugates, naked DNA, artificial virions, and agent-enhanced uptake of DNA. 
Lipofection is described in, e.g., U.S. Patent Nos. 5,049,386; 4,946,787; and 4,897,355 
20 and lipofection reagents are sold commercially (e.g., Transfectam"^^ and Lipofectin^^. 
Cationic and neutral lipids that are suitable for efficient receptor-recognition lipofection 
of polynucleotides include those of Feigner, WO 91/17424 and WO 91/16024. Nucleic 
acid can be delivered to cells {ex vivo administration) or to target tissues {in vivo 
administration). 

25 The preparation of lipidrnucleic acid complexes, including targeted liposomes 

such as immunolipid complexes, is well known to those of skill in the art. See, e,g.^ 
Crystal (1995) Science 270:404-410; Blaese etal (1995) Cancer Gene Ther. 2:291-297; 
Behr et al (1994) Bioconjugate Chem. 5:382-389; Remy et al (1994) Bioconjugate 
Chem. 5:647-654; Gao et al (1995) Gene Therapy 2:710-722; Ahmad et al (1992) 

30 Cancer Res. 52:4817-4^20; and U.S. Patent Nos. 4,186,183; 4,217,344; 4,235,871; 
4,261,975; 4,485,054; 4,501,728; 4,774,085; 4,837,028 and 4,946,787. 
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The use of RNA or DNA virus-based systems for the delivery of nucleic acids 
take advantage of highly evolved processes for targeting a virus to specific cells in the 
body and trafficking the viral payload to the nucleus. Viral vectors can be administered 
directly to patients (in vivo) or they can be used to treat cells in vitro, wherein the 
5 modified cells are administered to patients (ex vivo). Conventional viral based systems 
for the delivery of ZFPs include retroviral, lentivnal, poxviral, adenoviral, adeno- 
associated viral, vesicular stomatitis viral and herpesviral vectors. Integration in the host 
genome is possible with certain viral vectors, including the retrovirus, lentivirus, and 
adeno-associated virus gene transfer methods, often restilting in long term expression of 

10 the inserted trans gene. Additionally, liigh transduction efficiencies have been observed 
in many different cell types aad target tissues. 

The tropism of a retrovirus can be altered by incorporating foreign envelope 
proteins, allowing alteration and/or expansion of the potential target cell population. 
Lentiviral vectors are retroviral vector that arc able to transduce or infect non-dividing 

15 cells and typically produce high viral titers. Selection of a retroviral gene transfer system 
would therefore depend on the target tissue. Retroviral vectors have a packaging capacity 
of up to 6-10 kb of foreign sequence and are comprised of cis-suctmg long terminal 
repeats (LTRs). The minimum cfc-acting LTRs are sufficient for replication and 
packaging of the vectors, which are then used to integrate the therapeutic gene into the 

20 target cell to provide permanent transgene expression. Widely used retroviral vectors 
include those based upon murine leulcemia virus (MuLV), gibbon ape leukemia virus 
(GaLV), simian immunodeficiency virus (SIV), luiman inmiunodeficiency virus (HIV), 
and combinations thereof. Buchscher a/. (1992) ViroL 66:2731-2739; Johaxm etal. 
(1992) J: Virol. 66:1635-1640; Sommerfelt a/. (1990) ViroL 176:58-59; Wilson etaL 

25 (1989) Virol. 63:2374-2378; Miller e^^i. (1991)^. ViroL 65:2220-2224; and 
PCT/US94/05700). 

Adeno-associated virus (AAV) vectors are also used to transduce cells with target 
nucleic acids, e.g., in the in vitro production of nucleic acids and peptides, and for in vivo 
and ex vivo gene therapy procedures. See, e.g,. West et al (1987) Virology 160:38-47; 
30 U.S. Patent No. 4,797,368; WO 93/24641; Kotm (1994) Hum. Gene Ther. 5:793-801; 
and Muzyczka (1994) J. Clin. Invest 94:1351. Construction of recombinant AAV 
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vectors are described in a number of publications, including U.S. Patent No. 5,173,414; 
TratscWn ^zZ. (1985) Mo/. Cell Biol 5:3251-3260; TratscMn, (1984) Afo/. Cell 
Biol 4:2072-2081; Hermonat ei^ a/. (1984) Prac. Natl Acad. Set USA 81:6466-6470; 
and Samulski etal (1989) J: Virol 63:3822-3828. 
5 Recombinant adeno-associated virus vectors based on the defective and 

nonpathogenic parvovirus adeno-associated vims type 2 (AAV-2) are a promising gene 
delivery system. Exemplary AAV vectors are derived from a plasmid conta inin g the 
AAV 145 bp inverted temiinal repeats flanking a transgene expression cassette. Efficient 
gene transfer and stable transgene delivery due to integration into the genomes of the 

10 transduced cell are key featiues for this vector system. Wagner et al (1998) Lancet 
351©(9117): 1702-3; and Keams et al (1996) Gene Ther. 9:748-55. 

pLASTSf and MFG-S are examples are retroviral vectors that have been used in 
clinical trials. Dunbar a/. (1995) 85:3048-305; Koha et al (1995) Nature Med, 

1:1017-102; Maleche^a/. {1991) Proc, Natl Acad Set 2754 94:12133-12138. 

1 5 PA3 1 7/pLASN was the first therapeutic vector used in a gene therapy trial. (Blaese et al 
(1995) Science 270:475-480. Transduction efficiencies of 50% or greater have been 
observed for MFG-S packaged vectors. EUem et al (1997) Immunol Immunother. 
44(l):10-20; Dranoffe/a/. (1997) Hum. GeneTher. 1:111-2. 

In applications for which transient expression is preferred, adenoviral-based 

20 systems are useful. Adenoviral based vectors are capable of very high transduction 

efficiency in many cell types and are capable of infecting, and hence dehvering nucleic 
acid to, both dividing and non-dividing cells. With such vectors, high titers and levels of 
expression have been obtained. Adenovirus vectors can be produced in large quantities 
in a relatively simple system. 

25 Replication-deficient recombinant adenoviral (Ad) can be produced at high titer 

and they readily infect a number of different cell types. Most adenovirus vectors are 
engineered such that a transgene replaces the Ad El a, Elb, and/or E3 genes; the 
replication defector vector is propagated in human 293 cells that supply the required El 
functions in trans. Ad vectors can transduce multiple types of tissues in vivo, including 

30 non-dividing, differentiated cells such as those found in the liver, kidney and muscle. 
Conventional Ad vectors have a large carrying capacity for inserted DNA. An example 
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of the use of an Ad vector in a clinical trial involved polynucleotide therapy for antitumor 
immimization with intramuscvilar injection. Stennan et al (1998) Hum, Gene Ther, 
7:1083-1089. Additional exaniples of the use of adenovirus vectors for gene transfer in 
clinical trials include Rosenecker et al. (1996) Infection 24:5-10; Stemian et ah, supra\ 
5 Welsh et al (1995) Hum. Gene Ther. 2:205-218; Alvarez et al (1997) Hum, Gene Ther, 
5:597-613; and Topf al (1998) Gene Ther. 5:507-513. 

Packaging cells are used to form virus particles that are capable of infecting a host 
cell. Such ceils include 293 cells, which package adenovirus, and \|/2 cells or PAS 17 
cells, which package retroviruses. Viral vectors used in gene therapy are usually 

10 generated by a producer cell line that packages a nucleic acid vector into a viral particle. 
The vectors t3/pically contain the minimal viral sequences required for packaging and 
subsequent integration into a host, other viral sequences being replaced by an expression 
cassette for the protein to be expressed. Missing viral functions are suppUed in trans^ if 
necessary, by the packaging ceU line. For example, AAV vectors used in gene therapy 

15 typically only possess ITR sequences from the AAV genome, which are required for 
packaging and integration into the host genome. Viral DNA is packaged in a cell line, 
which contains a helper plasmid encoding the other AAV genes, namely rep and cap, but 
lacking ETR sequences. The cell line is also infected with adenovirus as a helper. The 
helper virus promotes replication of the AAV vector and expression of AAV genes from 

20 the helper plasmid. The helper plasmid is not packaged in significant amounts due to a 
lack of ITR sequences. Contamination with adenovirus can be reduced by, e.g,^ heat 
treatment, which preferentially mactivates adenovinises. 

In many gene therapy applications, it is desirable that the gene therapy vector be 
delivered with a high degree of specificity to a particular tissue type. A viral vector can 

25 be modified to have specificity for a given cell type by expressing a hgand as a fusion 

protein with a viral coat protein on the outer surface of the virus. The ligand is chosen to 
have affinity for a receptor known to be present on the cell type of interest. For example, 
Han et al (1995) Proc, Natl Acad. Set USA 92:9747-9751 reported that Moloney 
murine leukemia virus can be modified to express human hereguUn fiised to gp70, and 

30 the recombinant vims infects certaiu human breast cancer cells expressing human 

epidermal growth factor receptor. This principle can be extended to other pairs of vims 
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expressing a ligand fusion protein and target cell expressing a receptor. For example, 
filamentous phage can be engineered to display antibody fragments (e.g.. Fab or Fv) 
having specific binding affinity for virtually any chosen cellular receptor. Although the 
above description appUes primarily to viral vectors, the same principles can be applied to 
5 non-viral vectors. Such vectors can be engineered to contain specific uptake sequences 
thought to favor uptake by specific target cells. 

Gene therapy vectors can be delivered in vivo by administration to an individual 
patient, typically by systemic administration (e.g,, intravenous, intraperitoneal, 
intramuscular, subdermal, or intracranial infusion) or topical application, as described 

10 infra. Alternatively, vectors can be delivered to cells ex vivo, such as cells explanted 
from an individual patient (e.g., lymphocytes, bone marrow aspirates, tissue biopsy) ox 
universal donor hematopoietic stem cells, followed by reimplantation of the cells into a 
patient, usually after selection for cells which have incorporated the vector. 

Ex vivo cell transfection for diagnostics, research, or for gene therapy (e.g., via re- 

15 infusion of the transfected cells into the host organism) is well known to those of skill in 
the art. hi a preferred embodiment, cells are isolated from ttie subject organism, 
transfected with a nucleic acid (gene or cDNA), and re-infiised back into ttie subject 
organism (e.g., patient). Various cell types suitable for ex vivo transfection are well 
known to those of skill in the art. See, e.g., Freshney et al. Culture of Animal Cells, A 

20 Manual of Basic Technique, 3rd ed., 1994, and references cited therein, for a discussion 
of isolation and cultm'e of cells from patients. 

In one embodiment, hematopoietic stem cells are used in ex vivo procedures for 
cell transfection and gene therapy. The advantage to using stem cells is that they can be 
differentiated into other cell types in vitro, or can be introduced into a mammal (such as 

25 the donor of the cells) where they will engraft in the bone marrow. Methods for 

differentiating CD34+ stem ceUs in vitro into clinically important immune cell types 
using cytokines such a GM-CSF, IFN-y and TNF-y are known. Inaba et al (1 992) J. 
Exp. Med, 176:1693-1702. 

Stem cells are isolated for transduction and differentiation using laiown methods. 

30 For example, stem cells are isolated from bone marrow cells by panning the bone marrow 
cells with antibodies which bind unwanted cells, such as CD4+ and CDS+ (T cells). 
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CD45+ (paDB cells), GR-1 (granulocj^es), and lad (differentiated antigen presenting 
cells). See Inaba et ah, supra. 

Vectors (e.g., retroviruses, adenovirases, liposomes, etc) containing therapeutic 
nucleic acids can be also administered directly to the organism for transduction of cells in 
5 vivo. Altematively, naked DNA can be administered. Administration is by any of the 
routes normally used for introducing a molecule into ultimate contact with blood or tissue 
cells. Suitable methods of administering such nucleic acids are available and well known 
to those of skill in the art, and, although more than one route can be used to administer a 
particular composition,, a particular route can often provide a more immediate and more 

1 0 effective reaction than another route. 

Phamiaceutically acceptable carriers are determined in part by the particular 
composition being administered, as well as by the particular method used to administer 
the composition. Accordingly, there are a wide variety of suitable formulations of 
pharmaceutical compositions of the present disclosure, as described below. See, e.g., 

1 5 Remington 's Pharmaceutical Sciences^ 17th ed., 1989. 

Delivery vehicles 

In certain embodiments, one or more polypeptides, comprising a fusion between a 
DNA-binding domain and a component of a chromatin remodeling complex, can be 

20 introduced into a cell. An important factor in the administration of polypeptide 
compounds is ensuring that the polypeptide has the ability to traverse the plasma 
membrane of a cell, or the membrane of an intra-cellular compartment such as the 
nucleus. Cellular membranes are composed of lipid-protein bilayers that are freely 
permeable to small, nonionic lipophilic compounds and are inherently impermeable to 

25 polar compoimds, macromolecules, and therapeutic or diagnostic agents. However, 

proteins, Upids and other compounds, which have the ability to translocate polypeptides 
across a cell membrane, have been described. 

For example, "membrane translocation polypeptides" have amphiphilic or 
hydrophobic amino acid subsequences that have the ability to act as membrane- 

30 translocating carriers. In one embodiment, homeodomain proteins have the abihty to 

translocate across cell membranes. The shortest intemalizable peptide of a homeodomain 
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protein, Antennapedia, was found to be the third helix of the protein, jfrom amino acid 
position 43 to 58. Prochiantz (1996) Curr. Opin. Neurobiol 6:629-634, Another 
subsequence, the h (hydrophobic) domain of signal peptides, was found to have similar 
cell membrane translocation characteristics. Uin^etal (1995) J. Biol. Chem. 270:14255- 
5 14258. 

Examples of peptide sequences which can be linked to a fusion polypeptide for 
facilitating its uptake into cells include, but are not limited to: an 11 amino acid peptide 
of the tatij^rotem of HIV; a 20 residue peptide sequence which corresponds to amino 
acids 84-103 of the p 16 protein {see Fahraeus et al (1996) Curr, Biol 6:84); the third 

10 helix of the 60-amino acid long homeodomain of Antennapedia (Derossi et al (1994) J. 
Biol Chein. 269:10444); the h region of a signal peptide, such as the Kaposi fibroblast 
growth factor (K-FGF) h region (Lin et al, supra); and the VP22 translocation domain 
from HSV (Elliot et al (1997) Cell 88:223-233). Other suitable chemical moieties that 
provide enhanced cellular uptake can also be linked, either covalently or non-covalently, 

15 to fusion polypeptides. 

Toxin molecules also have the ability to transport polypeptides across cell 
membranes. Often, such molecules (called "binary toxins") are composed of at least two 
parts: a translocation or binding domain and a separate toxin domain. Typically, the 
translocation domain, which can optionally be a polypeptide, binds to a cellular receptor, 

20 facilitating transport of the toxin into the cell. Several bacterial toxins, including 

Clostridium perfringens iota toxin, diphtheria toxin (DT), Pseudomonas exotoxin A (PE), 
pertussis toxin (PT), Bacillus anthracis toxin, and pertussis adenylate cyclase (CYA), 
have been used to deliver peptides to the cell cytosol as internal or amino-terminal 
fusions. Axovdietal (1993) J. Biol Chem. 268:3334-3341; Perellee/a/. (1993) Infect 

25 Immun. 61:5147-5156; Stenniark a/. (1991)/. CellBioL 113:1025-1032; Domielly 
al (1993) Proc. Natl Acad, Set 90:3530-3534; Carbonetti a/. (1995) ^^^^r. 
Annu. Meet. Am. Sac. Microbiol 95:295; Sebo et al (1995) Infect. Immun. 63:3851- 
3857; Klimpel £z/. (1992) iVoa iV^zrf. ^catf. 5cz. L^^^ and Novak 

al (1992) J: Biol Chem. 267:17186-17193. 

30 Such subsequences can be used to translocate polypeptides, including fusion 

polypeptides as disclosed herein, across a cell membrane. This is accompHshed, for 
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example, by derivatizing the fusion polypeptide with one of these translocation 
sequences, or by forming an additional fusion of the translocation sequence with the 
fusion polypeptide. Optionally, a linker can be used to link the fusion polypeptide and the 
translocation sequence. Any suitable linker can be used, e.g., a peptide linker. 
5 A fusion polypeptide can also be introduced into an animal cell, preferably a 

mammalian cell, via liposomes and Uposome derivatives such as immunoliposomes. The 
term "liposome" refers to vesicles comprised of one or more concentrically ordered lipid 
bilayers, which encapsulate an aqueous phase. The aqueous phase typically contains the 
compound to be delivered to the cell. 

10 The liposome fuses with the plasma membrane, thereby releasing the compoimd 

into the cytosol. Alternatively, the liposome is phagocytosed or taken up by the cell in a 
transport vesicle. Once in the endosome or phagosome, the liposome is either degraded 
or it fuses with the membrane of the transport vesicle and releases its contents. 

In current methods of drug delivery via liposomes, the liposome ultimately 

1 5 becomes permeable and releases the encapsulated compound at the target tissue or cell. 
For systemic or tissue specific delivery, this can be accomplished, for example, in a 
passive mamier wherein the liposome bilayer is degraded over time through the action of 
various agents in the body. Alternatively, active drug release involves ushig an agent to 
induce a permeability change in the liposome vesicle. Liposome membranes can be 

20 constructed so that they become destabilized when the environment becomes acidic near 
the liposome membrane. See, e.g.. Proa Natl. Acad, ScL USA 84:7851 (1987); 
Biochemistry 28:908 (1989). When liposomes are endocytosed by a target cell, for 
example, they become destabilized and release their contents. This destabihzation is 
termed fusogenesis. Dioleoylphosphatidylethanolamine (DOPE) is the basis of many 

25 "fusogenic" systems. 

For use with the methods and compositions disclosed herein, liposomes typically 
comprise a fusion polypeptide as disclosed herein, a lipid component, e.g., a neutral 
and/or cationic lipidj and optionally include a receptor-recognition molecule such as an 
antibody that binds to a predetermined cell surface receptor or ligand {e.g., an antigen). 

30 A variety of methods are available for preparing liposomes as described in^ e.g. ; 

U.S. Patent Nos. 4,186,183; 4,217,344; 4,235,871; 4,261,975; 4,485,054; 4,501,728; 
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4J74,085iL4,8j37,028; 4,235,871; 4,261,975; 4,485,054; 4,501,728; 4,774,085; 
4,837,028; 4,946,787; PCT Publication No. WO 91/17424; Szokei et aL (1980) Ann, 
Rev, Biophys, Bioeng, 9:467; Deamer et al (1976) Biochim. Biophys, Acta 443:629-634; 
Fraley, et al (1979) Proc, Natl Acad. Set USA 76:3348-3352; Hope et al (1985) 
5 Biochim, Biophys. Acta 812:55-65; Mayer et al (1986) Biochim. Biophys. Acta 858:161- 
168; Williams et al (1988) Proc. Natl Acad. Set USA ^SilAl-lAS', Liposomes, Ostro 
(ed.), 1983, Chapter 1); Hope etal (1986) Chem, Phys. Lip. 40:89; Gregoriadis, 
Liposome Technology (1984) and Lasic, Liposomes: from Physics to Applications (1993). 
Suitable methods include, for example, sonication, extrusion, high 
10 pressure/homogenization, microfluidization, detergent dialysis, calcium-induced fusion 
of small Uposome vesicles and ether-fusion methods, all of wliich are well known in the 
art. 

In cerrtain embodiments, it may be desirable to target a liposome using targeting 
moieties fliat are specific to a particular cell type, tissue, and the like. Targeting of 
15 Uposomes using a variety of targeting moieties (e.g., ligands, receptors, and monoclonal 
antibodies) has been previously described. See, e.g., U.S. Patent Nos. 4,957,773 and 
4,603,044: 

Examples of targeting moieties include monoclonal antibodies specific to antigeoas 
associated with neoplasms, such as prostate cancer specific antigen and MAGE. Tumors 

20 can also be diagnosed by detecting gene products resulting from the activation or over- 
expression of oncogenes, such as ras or c-erbB2. In addition, many tumors express 
antigens nomially expressed by fetal tissue, such as the alphafetoprotein (AFP) and 
carcinoembryonic antigen (CEA). Sites of viral infection can be diagnosed using various 
viral antigens such as hepatitis B core and surface antigens (HBVc, HBVs) hepatitis C 

25 antigens, Epstein-Barr virus antigens, human immunodeficiency type-1 virus (HIV-1) 

and papilloma virus antigens. Inflammation can be detected using molecules specifically 
recognized by surface molecules which are expressed at sites of inflammation such as 
integrins (e.g., VCAM-1), selectin receptors (e.g., ELAM-1) and the like. 

Standard methods for coupling targeting agents to liposomes are used. These 

30 methods generally involve the incorporation into liposomes of lipid components, e.g-., 
phosphatidylethanolamine, which can be activated for attachment of targeting agents, or 
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incoxporation of derivatized lipophilic compounds^, such as lipid derivatdzed bleomycin. 
Antibody targeted Uposomes can be constructed using, for instance, liposomes which 
incorporate protein A, See Rermeisen et aL (l990)J.BioL Chem. 265:16337-16342 and 
Leonetti et al (1990) Proc, Natl Acad. Sci, USA 87:2448-2451, 

5 

Pharmaceutical compositions and administration 

Fusion polypeptides as disclosed herein, and expression vectors encoding fusion 
polypeptides, can be used in conjunction with various methods of gene therapy to 
facilitate the action of a therapeutic gene product. In such applications, a fusion 

10 poljTpeptide can be administered directly to a patient, e.g., to facilitate the modulation of 
gene expression and for therapeutic or prophylactic applications, for example, cancer, 
ischemia, diabetic retinopathy, macular degeneration, rheumatoid arthritis, psoriasis, HIV 
infection, sickle cell anemia, Alzheimer's disease, muscular dystrophy, 
neurodegenerative diseases, vascular disease, cystic fibrosis, stroke, and the like. 

15 Examples of microorganisms whose inhibition can be facilitated through use of the 

methods and compositions disclosed herein include pathogenic bacteria, e.g.. Chlamydia, 
Rickettsial bacteria, Mycobacteria, Staphylococci, Streptococci, Pneumococci, 
Meningococci and Conococci, Klebsiella, Proteus, Serratia, Pseudomonas, Legionella, 
Diphtheria, Salmonella, Bacilli (e.g., anthrax), Vibrio (e.g., cholera), Clostridium (e.g., 

20 tetanus, botulism), Yersinia (e.g., plague), Leptospirosis, and Borrellia (e.g., Lyme 

disease bacteria); infectious fungus, e.g., Aspergillus, Candida species; protozoa such as 
sporozoa (e.g., Plasmodia), rhizopods (e.g., Ejitamoeba) and flagellates (Tiypanosoma^ 
Leishmania, Trichomonas, Giardia, e^c.);viruses, e.g., hepatitis (A, B, or C), herpes 
viruses (e.g., VZV, HSV-1, HHV-6, HSV-II, CMV, and EBV), HIV, Ebola, Marburg and 

25 related hemorrhagic fever-causing viruses, adenoviruses, influenza vinxses, flaviviruses, 
echoviruses, rhinoviruses, coxsackie viruses, comaviruses, respiratory syncytial viruses, 
mumps vimses, rotaviruses, measles viruses, rubella viruses, parvoviruses, vaccinia 
viruses, HTLV viruses, retroviruses, lentiviruses, dengue viruses, papillomaviruses, 
polioviruses, rabies viruses, and arboviral encephalitis viruses, etc. 

30 Administration of therapeutically effective amounts of a fusion polypeptide or a 

nucleic acid encoding a fusion polypeptide is by any of the routes normally used for 
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iBtroducing polypeptides or nucleic acids into ultimate contact with the tissue to be 
treated. The fusion polypqptides or nucleic acids are administered in any suitable 
manner, preferably with pharmaceutically acceptable carriers. Suitable methods of 
administering such modulators are available and well known to those of skill in the art, 
5 and, although more than one route can be used to administer a particular composition, a 
particular route can oJften provide a more immediate and more effective reaction than 
another route. 

Pharmaceutically acceptable carriers are determined in part by the particular 
composition being administered, as well as by the particular method used to administer 
10 the composition. Accordingly, there are a wide variety of suitable formulations of 

pharmaceutical compositions. See, e.g.. Remington's Pharmaceutical Sciences, 17^' ed. 
1985. 

Fxxsion polypeptides or nucleic acids, alone or m combination with other suitable 
components, can be made into aerosol formulations (z.e., they can be "nebulized") to be 

15 administered via inhalation. Aerosol formulations can be placed into pressurized 

acceptable propellants, such as dichlorodifluoromethane, propane, nitrogen, and the tike. 

Formulations suitable for parenteral administration, such as, for example, by 
intravenous, intramuscular, intrademial, and subcutaneous routes, include aqueous and 
non-aqueous, isotonic sterile injection solutions, which can contain antioxidants, buffers, 

20 bacteriostats, and solutes that render the formulation isotonic with the blood of the 

intended recipient, and aqueous and non-aqueous sterile suspensions that can include 
suspending agents, solubilizers, thickening agents, stabilizers, and preservatives. 
Compositions can be administered, for example, by intravenous infusion, orally, 
topically, intraperitoneally, intravesically or intrathecally. The formulations of 

25 compoimds can be presented in unit-dose or multi-dose sealed containers, such as 

ampoules and vials. Injection solutions and suspensions can be prepared Jfrom sterile 
powders, granules, and tablets of the kind known to those of skiU in the art. 



Assays for chromatin remodeling 

3 0 Nimierous activities of chromatin remodeling complexes have been described, 

including but not limited to the following. A characteristic activity of all chromatin 
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remodeling complexes is nucleosome- or DNA-dependent ATPase activity. Chromatin 
remodeling complexes can facilitate binding of transcription factors to genes in a 
chromatin context and facilitate accessibility of sequences in chromatin to restriction 
enzymes and other nucleases. Certain remodeling complexes (those containing the IS WI 
5 ATPase) also possess the ability to assemble periodic nucleosome arrays {i.e. they are 
capable of spacing nucleosomes). Changes in DNA topology {i.e., degree of 
supercoiling) can also result &om the action of chromatin remodeling complexes; these 
are believed to reflect either alterations of the path of DNA along the nucleosome or 
alterations in the path of linker DNA along the chromatin fiber. Chromatin remodehng 

10 complexes are also capable of transferring histones firom chromatin to either DNA or 

protein acceptors. Stimulation of transcription initiation can also result from the action of 
chromatin remodeling complexes. 

Without wishing to be bound by any particular theory, the inventors recognize the 
possibility that the mechanism underlying all the above-mentioned activities may be the 

15 ability of chromatin remodeling complexes to promote nucleosome sliding or, more 
basically, to destabilize the histone-DNA interaction. Accordingly, any protein or 
multiprotein complex capable of destabilizing histone-DNA interactions and/or 
promoting nucleosome movement is suitable for use as a component of a chromatin 
remodeling complex. 

20 The various activities of chromatin remodeling complexes can be assayed by a 

number of techniques, as are known to those of skill in the art, and as have been 
described in publications disclosing the isolation and characterization of the various 
chromatin remodeling complexes, as set for^ supra. See also Imblazano etal (1994) 
Nature 370:481-485 and Cote et al (1993) Science 265:53-60 for descriptions of assays 

25 involving facilitation of transcription factor binding. Assays involving nucleosome 

repositioning are described by, for example, Hamiche et aL (1999) Cell 97:833-842 and ' 
Guschin et al (2000) Biochemistry 39:5238-5245. Accordingly, it is possible for one of 
skill in the art to determine whether a given multiprotein complex is a chromatin 
remodeling complex and to determine whether a particular polypeptide is a component of 

30 a chromatin remodeling complex or functional fegment thereof. Additional examples of 
assays for cltromatin remodeling activity are provided injra and in publications such as 
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Methods in Enzymology, Vol. 304, "Chromatin" (P.M. Wassarman and A. P. Wolffe, 
eds.). Academic Press, San Diego, 1999; and Methods in Molecular Biology, Vol. 119, 
"Chromatin Protocols" (P.B. Becker, ed.) Humana Press, Totowa, 1999. See also U.S, 
Patent No. 5,972,608. 

5 An additional assay for chromatin modification is modulation of gene expression, 

when the modification is part of a two-step process in which chromatin modification 
allows binding of a molecule which modulates gene expression {e.g,^ a polypeptide 
comprising a fusion between a zinc finger DNA-binding domain and a transcriptional 
regulatory domain). Assays for gene modulation (e.g., transcriptional activation and/or 
10 repression, reporter gene activity^ measurement of protein levels) are well-known to those 
of skill in the art and are described, for example, in co-owned WO 00/41566. 

Applications 

The compositions and methods disclosed herein can be used to facilitate a number 
1 5 of processes involving cellular chromatin. These processes include, but are not limited 
to, transcription, replication, recombination, repair, integration, maintenance of 
telomeres, and processes involved in chromosome stability and disjunction. Accordingly, 
the methods and compositions disclosed herein can be used to affect any of these 
processes, as well as any other process which can be influenced by chromatin structure 
20 such as, for example, detection of specific sequences or sequence variants in cellular 
chromatin. 

Targeted modification of chromatin structm-e, as disclosed herein, can be used in 
processes such as, for example, therapeutic regulation of disease-related genes, 
engineering of cells for manufacture of protein pharmaceuticals, pharmaceutical 

25 discovery (including target discovery, target validation and engineering of cells for high 
throughput screening methods) and plant agriculture. 

For example, in one embodiment, chromatin modification facilitates access of one 
or more transcriptional regulatory factors to a target site in cellular chromatin, thereby 
participating in modulation of gene expression. Modulation of gene expression can 

30 include either increases or decreases in the level of gene expression. In another 
exemplary embodiment, chromatin modification increases the efficiency of 
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recombination, thereby facilitating, for example, targeted integi"ation of aa exogenous 
nucleic acid. 

For the purposes of the present disclosxu-e, chromatin includes any cellular 
nucleoprotein structure. This can include, but is not limited to chromosomes (z.e., nuclear 
5 genomes), episomes, organellar nucleoproteins, such as mitochondrial and chloroplast 
genomes, and nucleoproteins associated with infecting bacterial or viral genomes. It is 
known that non-eukaryotic genomes are organized into nucleoprotein structures. In 
eukaryotic cells, the genome is enclosed in the nucleus. Accordingly, contact of a 
molecule with cellular chromatin includes introduction of the molecule into the nucleus 
10 of a cell. 

Cells include, but are not limited to, prokaryotic, eukaayotic and Archaeal cells. 
Eukaryotic cells include plant, fungal, protozoal and animal cells, including mammalian 
cells, primate cells and human cells. 

In one embodiment, modification of chromatin is used to facilitate the modulation 

15 of gene expression. Modulation can include gene activation aud gene repression, as well 
as more subtle increases or decreases in the level of gene expression. Activation of gene 
expression can be mediated, for instance, by the activity of a histone acetyl transferase 
that has been recruited to a region of interest by the methods and compositions disclosed 
herein. Repression of gene expression can be mediated, for iastance, by the activity of a 

20 histone deacetylase that has been recruited to a region of interest by the methods and 

compositions disclosed herein. Without wishing to be bound by any particular theory, it 
is believed that modification of chromatin in the vicinity of a particular gene will make 
that gene's regulatory sequences more (or less^ in the case of repression) accessible to 
transcriptional activators. Alternatively^ chromatiu modification could render regulatory 

25 sequences more accessible to transcriptional repressors or less accessible to positive 
transcriptional regulatory factors. 

Accordiagly, expression of any gene in any organism can be modulated by 
chromatia modification as disclosed herein, including therapeutically relevant genes, 
genes of infecting microorganisms, viral genes, and genes whose expression is modulated 

30 in the process of target validation. Such genes include, but are not limited to, vascular 
endothelial growth factor (VEGF), VEGF receptors fit isnAjlk, CCR-5, low density 
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lipoprotedn receptor (LDLR), estrogen receptor, HER-2/neu, BRCA-1, BRCA-2, 
phosphoenolpymvate carboxykinase (PEPCK), CYP?, fibrinogen, apolipoprotein A 
(ApoA), apolipoprotein B (ApoB), renin, phosphoenolpymvate carboxykinase (PEPCK), 
CYP7, fibrinogen, nuclear factor icB (NF-kB), inhibitor of NF-kB (I-kB), tumor necrosis 
5 factors (e.g., TNF-a, TNF-P), interleukin-l (IL-1), FAS (CD95), FAS ligand (CD95L), 
atrial natriuretic factor, platelet-derived factor (PDF), amyloid precvirsor protein (APP), 
t5T0sinase, tyrosine hydroxylase, P-aspartyl hydroxylase, alkaline phosphatase, calpains 
(e.g., CAPNIO) neuronal pentraxin receptor, adriamycin response protein, apolipoprotein 
E (apoB), leptin, leptin receptor, UCP-1, IL-1, IL-1 receptor, IL-2, IL-3, IL-4, IL-5, IL-6, 

10 IL-12, IL-1 5, interleukin receptors, G-CSF, GM-CSF, colony stimulating factor, 

erythropoietin (EPO), platelet-derived growth factor CPDGF), PDGF receptor, fibroblast 
growth factor (FGF), FGF receptor, PAF, pl6, pl9, p53, Rb, p21, myc, myb, globin, 
dystrophin, eutrophin, cystic fibrosis transmembrane conductance regulator (CFTR), 
GNDF, nerve growth factor (NGF), NGF receptor, epidermal growth factor (EGF), EGF 

15 receptor, transforming growth factors (e.g., TGF-a, TGF-p), fibroblast growth factor 
(FGF), interferons (e.g, IFN-a, IFN-p and IFN-y), insulin-related growth factor- 1 
(IGF-1), angiostatiu, ICAM-1, signal transducer and activator of transcription (STAT), 
androgen receptors, e-cadherin, cathepsins (e.g., cathepsin W), topoisomerase, 
telomerase, bcl, bcl-2^ Box, T Cell-specific tyrosine kinase (LcK), p38 mitogen-activated 

20 protein kinase, protein tyrosine phosphatase (hPTP), adenylate cyclase, guanylate 
cyclase, a7 neuronal nicotinic acetylcholine receptor, 5-hydroxytryptamine 
(serotoiiin)-2A receptor, transcription elongation factor-3 (TEF-3), phosphatidylcholine 
transferase, ^z, PTI-1, polygalacturonase, EPSP synthase, FAD2-1, A-9 desaturase, A-12 
desaturase. A- 15 desaturase, acetyl-Coenzyme A carboxylase, acyl-ACP thioesterase, 

25 ADP-glucose pyrophosphorylase, starch synthase, cellulose synthase, sucrose synthase, 
fatty acid hydroperoxide lyase, and peroxisome prohferator-activated receptors, such as 
PPAR-Y2, 

Expression of human, mammalian, bacterial, fungal, protozoal Archaeal, plant 
and viral genes can be modulated; viral genes include, but are not limited to, hepatitis 
30 vims genes such as, for example, HBV-C, HBV-S, HBV-X and HBV-P; and HIV genes 
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such as, for example, tat and rev. Modulation of expression of genes encoding antigens 
of a pathogenic organism can be achieved using the disclosed methods and compositions. 

Additional genes include those encoding cytokines, lymphokines, interleukins, 
growth factors, mitogenic factors, apoptotic factors, cytochromes, chemotactic factors, 
5 chemokine receptors {e.g., CCR-2, CCR-3, CCR-5, CXCR-4), phospholipases (e.^., 
phospholipase C), nuclear receptors, retinoid receptors^ organellar receptors, hormones, 
hormone receptors, oncogenes, tumor suppressors, cycHns, cell cycle checkpoint proteins 
(e.^.,Chkl, Clik2)5 senescence-associated genes, immunoglobulins, genes encoding heavy 
metal chelators, protein tyrosine kinases, protein tyrosine phosphatases, tumor necrosis 

10 factor receptor-associated factors (e.g,, Traf-3, Traf-6), apolipoproteins, thrombic factors, 
vasoactive factors, neuroreceptors, cell smface receptors, G-proteins, G-protein-coupled 
receptors {e.g.^ substance K receptor, angiotensin receptor, a- and p-adrenergic receptors, 
serotonin receptors, and PAF receptor), muscarinic receptors, acetylcholine receptors, 
GAB A receptors, glutamate receptors, dopamine receptors, adhesion proteins (e.g,^ 

15 CAMs, selectins, integrins and immunoglobulin superfamily members), ion channels, 
receptor-associated factors, hematopoietic factors, transcription factors, and molecules 
involved in signal transduction. Expression of disease-related genes, and/or of one or 
more genes speciJBc to a particular tissue or cell type such as, for example, brain, muscle, 
heart, nervous system, circulatory system, reproductive system, genitourinaty system, 

20 digestive system and respiratory system can also be modulated. 

The fusion molecules disclosed herein comprise a DNA-binding domain which 
binds to a target site. In certain embodiments, the target site is present in an accessible 
region of cellular chromatin. Accessible regions can be determined as described in co- 
owned U.S. Patent Application entitled "Databases of Accessible Region Sequences; 

25 Methods of Preparation and Use Thereof reference SI 5, filed even date herewith. In 
additional embodiments, the DNA-binding domain of a fusion molecule is capable of 
binding to cellular chromatin regardless of whether its target site is in an accessible 
region or not. For example, such DNA-binding domains are capable of binding to linker 
DNA and/or nucleosomal DNA. Examples of this type of **pioneer" DNA binding 

30 domain are found in certain steroid receptor and in hepatocyte nuclear factor 3 (HNF3). 
Cordingley et al, supra; Pina et aL, supra; and Cirillo et al, supra. 
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Methods of chromatin modification in a region of interest can be combined with 
methods involving binding of endogenous or exogenous transcriptional regulators in the 
region of interest to achieve modulation of gene expression. Modulation of gene 
expression can be in the form of repression as, for example, when the target gene resides 
5 in a pathological infecting microorganism or in an endogenous gene of the subject, such 
as an oncogene or a viral receptor, that contributes to a disease state. Alternatively, 
modulation can be in the form of activation, if activation of a gene {e.g., a tumor 
suppressor gene) can ameUorate a disease state. For such applications, an exogenous 
molecule can be formulated with a pharmaceutically acceptable carrier, as is known to 

10 those of skill in the art. See, for example. Remington's Pharmaceutical Sciences^ 17*^ 
ed., 1985; and co-owned WO 00/42219. 

In one embodiment, a method involves the use of a fusion molecule comprising a 
DNA-binding domain and a component of a chromatin remodelmg complex, to modify 
chromatin structure in a region of interest, in combination with a second molecule having v, 

15 transcriptional regulatory activity which binds in the region of interest only after 

modification of chromatin structure in the region of interest. In certain embodiments, the 
second molecule comprises a fusion between a DNA-binding domain and eiliier a 
transcriptional activation domain or a transcriptional repression domain. Any 
polypeptide sequence or domain capable of influencing gene expression, which can be 

20 fused to a DNA-binding domain, is suitable for use. Activation and repression domains 
are known to those of skill in the art and are disclosed, for example, in co-owned WO 
00/41566. 

Exemplary activation domains include, but are not limited to, VP 16, VP64, p300, 
CBP, PCAF.SRCl PvALF, AtHD2A and ERF-2. See, for example, Robyr et al (2000) 

25 Mol Endocrinol 14:329-347; Collmg^oodL et al {1999) J. Mol Endocrinol. 23\255- 
275; Leo (2000) Ge/ze 245:1-11; Manteuffel-Cymborowska (1999)^c^a5zoc;izw. 
Pol 46:77-89; McKenna et al (1999) J. Steroid Biockem. Mol Biol 69:3-12; Malik et 
al (2000) Trends Biochem. Sci. 25:277-283; and Lemon a/. (1999) Curr. Opin. Genet, 
Dev. 9:499-504. Additional exemplary activation domains include, but are not limited 

30 to, OsGAI, HALF-1, CI, API, ARF-5, -6, -7, and -8, CPRFl, CPRF4, MYC-RP/GP, and 
TRABl. See, for example, Ogawa et al (2000) Gene 245:21-29; Okanami et al (1996) 
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Genes Cells 1:87-99; Goffetal (1991) Genes Dev. 5:298-309; Cho etal (1999) Pta 
Mol Biol 40:419-429; Ulmason et al (1999) Proc. Natl Acad Set USA 96:5844-5849; 
Spreuger-Haussels etal (2000) Plant J, 22:1-8; Gong etal (1999) Plant Mol Biol 
41:33-44; and Hobo etal (1999) Proa Natl Acad Set USA 96:15,348-15,353. 
5 Exemplary repression domains include, but are not limited to, KRAB, SID, 

MBD2, MBD3, members of the DNMT family (e,g., DNMTl, DNMT3A, DNMT3B), 
Rb, andMeCP2. See, for example, Bird etal (1999) Cell 99:451-454; Tyler et al 

(1999) C^// 99:443-446; Knoepfler a/. (1999) Ce// 99:447-450; and Robertson a/. 

(2000) Nature Genet. 25:338-342. Additional exemplary repression domains include, but 
10 are not limited to, ROM2 and AtHD2A. See, for example, Chem et al (1996) Plant Cell 

8:305-321; and Wn et al (2000) Plant J. 22119-27. 

It is likely that many transcriptional regulatory molecules, both endogenous and 
exogenous, are unable to interact with their target sites (and, hence, unable to exert their 
regulatory effects) when the target site is present in cellular chromatin. Without wishing 

15 to be bound by any particular theory, it is believed that chromatin modification in a 
region of interest can make such target sites accessible to their binding molecules. 
Accordingly, the methods and compositions disclosed herein complement methods of in 
vivo gene regulation using exogenous molecules, in those cases in which the target site 
for the exogenous molecule is not in an accessible region in cellular chromatin. Methods 

20 of gene regulation using exogenous molecules are disclosed, for example, in co-owned 
WO 00/41566. These include applications in regulation of plant gene expression, 
functional genomics and transgenic animals. 

Significant difficulties currently exist in therapeutic situations which require the 
reactivation of a developmentally-silenced gene. Developmentally-induced gene 

25 inactivation can be mediated by methylation of CpG islands in the upstream region of a 
gene. Thus, use of a binding domain specific for methylated DNA as the DNA-binding 
portion of a fusion can facilitate recruitment of a chromatin remodeling complex to the 
upstream region of a developmentally-silenced gene, making the gene accessible to 
exogenous regulatory factors, and resulting in therapeutic re-activation of the gene. In 

30 another embodiment, a fusion between a methylated DNA-binding domam and a 
demethylase can be used for reactivation of a gene silenced by methylation. 
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The compositions and methods disclosed herein are useful in a variety of 
applications and provide advantages over existing methods. These include therapeutic 
methods in which an exogenous molecule is administered to a subject and used to 
modulate expression of a target gene within the subject. See, for example, co-pending 
5 WO 00/41 566* The disclosed compositions and methods can also facilitate detection of 
particular sequences by binding of an exogenous molecule to a binding site in cellular 
chromatin as in, for example, diagnostic applications. Methods for detection of a target 
sequence using, for example, a ZFP are described in co-owned WO 00/42219. For 
example, an exogenous molecule, such as a sequence-specific DNA binding protein, can 

10 be used to detect variant alleles associated with a disease or with a particular phenotype 
in patient samples and to detect the presence of pathological microorganisms in clinical 
samples. In one embodiment, a variant allele comprises a single-nucleotide 
polymorphism (SNP). In a non-mutually exclusive embodiment, flie sequence-specific 
DNA binding protein is a ZFP. Exogenous molecules can also be used to quantify copy 

1 5 nimaber of a gene in a sample. For example, detection of the loss of one copy of a p53 
gene in a clinical sample is an indicator of susceptibiUty to cancer. Additionally, 
identification of transgenic plants and animals can be accomplished tlirough detection of 
a transgene using, for example, binding of a sequence-specific exogenous molecule (such 
as, for example, a ZFP) as an assay. All of these procedures can be enhanced by 

20 recruitment of a chromatin remodeling complex to a region of interest in cellular 
chromatin to faciUtate binding of a binding molecule in the region of interest. 

The disclosed methods and compositions, when used in conjunction with methods 
of binding of exogenous molecules to cellular chromatin, can be used in assays to 
determine gene function and to determine changes in phenotype resulting from specific 

25 modulation of gene expression. See, for example, co-owned U.S. Patent Application 
Serial No. 09/395,448, filed September 14, 1999. 

EXAMPLES 

The following examples are presented as illustrative of, but not limiting, the 
30 claimed subj ect matter. 
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Example 1 : Design, synthesis and binding properties of a zinc finger DNA- 
binding domain which recognizes a target sequence in the human VEGF gene 

Target site 

A ziiic finger DNA-bindLng domain, which recognizes the human vascular 
5 endothelial growth factor-A (VEGF) gene, was designed and constructed according to 
design rules and methods disclosed in co-owned WO 00/42219, WO 00/41566, and co- 
owned U.S. Patent Applications Serial Nos. 09/444,241 filed November 19, 1999, and 
09/535,088 filed March 23, 2000. The target site, which overlaps the transcription 
initiation site for the human VEGF-A gene, is shown below as SEQ ID NO: 1, with the 
10 arrow indicating the transcription startsite. 

□ 

5' -GGGGAGGAT-3' 

3 ' - CCCCTCCTA- 5 ' (SEQ ID NO: 1) 



15 Backbone structure 

The human SP-1 zinc finger transcription factor was used as backbone for the 
construction of a designed fhree-finger DNA binding domain, Vegl, capable of 
recognizing this sequence. SP-1 has a three finger DNA-binding domain related to the 
well-studied murine zinc finger protein Zif268. Christy et al (1988) Proc. Natl Acad, 

20 Set USA 85:7857-786L Site-directed mutagenesis experiments using this domain have 
shown that correlations between the amino acid sequence of a zinc finger and its target 
nucleotide sequence, derived fiom analyses of Zi£268, are also applicable to SP-1 and 
hence can be used to adapt the specificity of SP-1 to DNA sequences other than its 
normal target site. Desjarlais et al (1994) Proc. Natl Acad, Set USA 91 : 1 1099-1 1 1 03. 

25 The portion of tlie SP-1 sequence used for construction of designed zinc finger DNA 
binding domains corresponds to amino acids 533 to 624. 

Amino acid sequences of designed DNA-binding domains are illustrated in 
Table 1. As can be seen in the Table, the designed Vegl protein comprises three zinc 
fingers (Fl, F2 and F3) which together recognize a 9-base pair target site. The amino 

30 acid sequence of the recognition helix (positions -1 through+6, where +1 is the first 
amino acid in the a-helix) for each of the DNA-binding fingers is given. 



58 



CA 02407460 2002-10-24 
WO 01/83793 PCT/USOl/40616 



Table 1: Target sites and ZFP DNA-binding domains in the human VEGF-A gene 



Name 


Target site 


Location 


AA sequence 


Vegl 


5'-GGGGAGGAT-3' 


-8 to +1 


Fl 


TTSNLRR 


(SEQ ID NO: 3) 




(SEQ ID NO: 2) 




F2 


RSSNLQR 


(SEQ ID NO: 4) 








F3 


RSDHLSR 


(SEO ID NO: 5) 


VegSa 


5'-GCGGAGGCT-3' 


+3 to +11 


Fl 


QSSDLQR 


(SEQ ID NO: 7) 




(SEQ ID NO: 6) 




F2 


RSSNLQR 


(SEQ ID NO: 8) 








F3 


RSDELSR 


(SEQ ID NO: 9) 



Sequences encodins^ the Vesl DNA-biJidins domain 
5 A polymerase chain reaction (PCR)-based assembly procedure, using six 

overlapping oligonucleotides, was applied to the synthesis of a synthetic gene encoding 
tlie Vegl DNA-binding domain. See Figure 1. Three of the oligomxcleotides (1, 3, and 5 
in Figure 1) correspond to "universal" sequences that encode portions of the DNA- 
binding domain between the recognition helices. These ohgonucleotides are constant for 

1 0 any given zinc finger construct. The other three "specific" oligonucleotides (2, 4, and 6 
in Figure 1) were designed to encode the recognition helices. These oligonucleotides 
contained different sequences encoding amino acids at positions -1, +2, +3 and +6 in 
each recognition helix, depending on its target triplet sequence. Codon bias was chosen 
to allow expression in both mammalian cells and E, colL 

15 Assembly of Vegl coding sequences was carried out as follows. First, the six 

oligonucleotides (three universal and three specific, as described above) were combined 
and annealed at 25oC to form a gapped DNA scaffold. Next, gaps were filled by 
conducting a four-cycle PGR reaction (using Taq and Pfu thermostable DNA 
polymerases) to generate a double-stranded template. This template was amplified (for 

20 thirty cycles) using a pair of external primers containing Kpn I and Hind III restriction 
sites, PGR products were directly cloned into the Kpn I and Hind III sites of the Tac 
promoter vector, pMal-c2 (New England Biolabs, Beverly, MA). The Vegl zinc finger 
DNA-binding domain was expressed from this vector and purified as a fusion with the 
maltose binding protein according to the manufacturer's instructions (New England 

25 Biolabs, Beverly, MA). 
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Accuracy of the Vegl clone was verified by DNA sequencing. The Vegl 
nucleotide and amino acid sequences are as follows. 
Vegl nucleotide sequence: 
Kpnl 

5 GGTACCCATACCTGGCAAGAAGAAGCAGCACATCTGCCACATCCAGG 
GCTGTGGTAAAGTTTACGGCACAACCTCAAATCTGCGTCGTCACCTGCGCTGG 
CACACCGGCGAGAGGCCTTTCATGTGTACCTGGTCCTACTGTGGTAAACGCTT 
CACCCGTTCGTCAAACCTGCAGCGTCACAAGCGTACCCACACCGGTGAGAAG 
AAATTTGCTTGCCCGGAGTGTCCGAAGCGCTTCATGCGTAGTGACCACCTGTC 
10 CCGTCACATCAAGACCCACCAGAATAAGAAGGGTGGATCC (SEQ ID NO: 10) 

BaanHI 

Vegl amino acid sequence 

VPIPGKKKQHICfflQGCGKVYGTTSNLRRHLRWHTGERPFMCTWSYCGK 
RFTRSSNLQRHKRTHTGEK^a^ACPECPKI^^ 
15 (SEQ ID NO: 11) 

Expression ofVe^l 

Expression of designed ZFPs was carried out in two different systems. In the 
first, the DNA-binding peptides were expressed in E. coU by inserting them into tbe 

20 commercially available pETlSb vector (Novagen). This vector contains a T7 promoter 
sequence to drive expression of the recombinant protein. Constmcts were introduced into 
E. coll BL21/DE3 (laclq) cells, which contain an IPTG-inducible T7 RNA polymerase. 
Cultures were supplemented with 50 |liM ZnC12j were grown at 37^C to an OD at 600 nm 
of 0.5-0.6, and protein production was induced with IPTG for 2 lirs. These proteins are 

25 refen-ed to as "unfused" ZFPs. 

Partially pure unflised ZFPs were produced as follows (adapted from Desjarlais et 
aL (1992) Proteins: Structure, Function and Genetics 12:101-104). A frozen cell pellet 
was resuspended in l/50th volume of 1 M NaCl, 25 mM Tris-HCl (pH 8,0), 100 jaM 
ZnCl2p 5 mM DTT. Samples were boiled for 10 min and centrifuged for 10 mia at 

30 '-SjOOO X g. At this point, ZFP protein in the supernatant was >50% pure (as estimated by 
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staining of SDS-polyacrylamide gels with Coomassie blue), and the product migrated at 
the predicted molecular weight of around 1 1 kDa. 

The second method for producing ZFPs was to express them as fusions to the 
E, coli Maltose Binding Protein (MBP), N-terminal MBP fusions to ZFPs were 
5 constructed by PGR amplification of the pETISb clones and insertion into the vector 
pMal"C2 (New England Biolabs) under tlie control of the Tac promoter. The fusion 
allows simple purification and detection of recombinant protein. It had been reported 
previously that zinc finger DNA-binding proteins can be expressed from this vector in 
soluble foim to high levels in E. coli and can bind efficiently to the appropriate DNA 

10 target without refolding. Liu et al (1997) Proc, Natl Acad, ScL USA 94:5525-5530. 
Production of MBP-fused proteins was as described by the manufacturer (New England 
Biolabs, Beverly, MA). Transformants wei"e grown in LB medium supplemented with 
glucose and ampicillin, and were induced with IPTG for 3 hrs at 37°C. The cells were 
lysed by French press, then exposed to an agarose-based amylose resin, which 

1 5 specifically binds to the MBP moiety, thus acting as an affinity resin for the MBP fusion 
protein. The MBP fusion protein was eluted with 10 mM maltose to release ZFP of 
>50% purity. In some cases, protein was fltrther concentrated using a Centricon 30 filter 
unit (Amicon). 

20 Determination ofbindins: affinUv 

Partially purified ZFPs (both unfused and MBP fusions) were tested by 
electrophoretic mobility shift assay (EMSA) to assess their abiUty to bind to their target 
DNA sequences. Protein concentrations were measured by Bradford assay (BioRad). 
Since SDS-polyacrylamide gels demonstrated >50% homogeneity of ZFP produced by 

25 either purification method, no adjustment was made for ZFP purity in the calculations. 
For tMs reason, the data generated by EMSA (shown below) rqpresent an underestimate 
of the tme aJBBnity of the proteins for their targets (i.e., will be overestimated). In 
addition, inactive protein in the preparations could also contribute to an underestimate of 
the binding affinity of the active molecules in the preparation. Two separate preparations 

30 of protein were used for determination of Ka, to help control for differences in ZFP 
activity. 
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A 29-mer duplex oligonucleotide was used as a binding target for electrophoretic 
mobility shift analysis of Vegl. The sequence of the duplex (with VEGF sequences in 
bold and target site under/overlined) was as follows: 

5 5 ' - CATGCATAG CGGGGAGGAT CGCCATCGAT - 3 ' 

3 ' -GTACGTATCGCCCCTCCTAGCGGTAGCTA-5' 
(SEQ ID NO: 12) 

The top strand was labeled, prior to annealing, with polynucleotide kinase and 

10 y-'^^P ATP. Top and bottom strands were annealed in a reaction containing each 

oligonucleotide at 0.5 pM, 10 mM Tris-HCI (pH 8.0), 1 mM EDTA, and 50 roM NaCl. 
The mix was heated to 95°C for 5 min and slow-cooled to 30*^0 over 60 min. Duplex 
formation was confirmed by polyacrylamide gel electrophoresis. Free label and single 
stranded DNA remaining in the target preparations did not appear to interfere with the 

1 5 binding reactions* 

Assays for binding of Vegl to the target oligonucleotide (above) were performed 
by titrating protein against a fixed amoimt of duplex target. Binding reactions contained 
50 pM 5' ^^P labeled double stranded target DNA, 10 mM Tris-HCI (pH 7.5), 
100 mM KCl, 1 mM MgCb, 1 mM dilhiotbreitol, 10% glycerol, 200 fig/nil bovine serum 

20 albumin, 0.02% NP-40, 20 ng/ml poly dl-dC (optionally), and 100 jiM ZnClj, in a final 
volume of 20 ^il. Protein was added to the binding reaction as one-fifth volume from a 
dilution series made in 200 mM NaCl, 20 mM Tris (pH 7.5), 1 mM DTT. Binding was 
allowed to proceed for 45 min at room temperature. Polyacrylamide gel electrophoresis 
was carried out at room temperature using precast 10% or 10-20% Tris-HCI gels 

25 (BioRad, Hercules, CA) and Tris-Glycine running buffer (25 mM Tris-HCI, 192 mM 
glycine, pH 8.3) containing 0.1 mM ZnCl2. Radioactive signals were quantitated with a 
Phosphorimager. 

Figure 2 shows the results of EMS A analysis of Vegl, using a four-fold dilution 
series of the Vegl protein. Shifted product, indicative of labeled target with bound 
30 protein, is indicated by an arrow in Figure 2A. The amount of shifted product was 
determined at each protein concentration and quantitated on a Phosphorimager 
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(Molecular Dynamics). The relative signal (percent of maximal amount of shifted 
product) was plotted as a function of logio protein concentration. In this case, the protein 
concentration yielding half-maximal bmding of Vegl to its target site (i.e., the apparent 
Kd) was approximately 50 nM. MBP-fiised and unfiised versions of Vegl bound to the 
5 target site with similar affinities. 

Example 2: Construction of a gene encoding a fusion between the Vegl DNA 
binding domain and hBAF 155 

The Vegl DNA binding domain is subcloned into a eukaryotic expression vector, 

10 in such a way that it is fiised to the hBAF155 subunit of the brm/BRG cliromatin 

remodeling complex. First, a cDNA sequence encoding a full length BAF155 protein is 
cloned using long range PGR, Barnes (1994) Proc. Natl Acad. Scl USA 91:2216-2220; 
Cheng et al (1994) Proc. Natl Acad. Set USA 91:5695-5699. Reagents and enzymes for 
performing long-range PGR are available from Roche Molecular Biochemicals 

1 5 (Indianapolis, IN) under the name "Expand PGR System." The oligonucleotide primers 
are homologous to sequences just upstream of tlie translation initiation codon at 
nucleotide 55 and just downstream of the final codon (proline at nucleotide 3366). (The 
BAFl 55 numbering scheme refers to the Genbank Accession number U66615.) In 
addition, the primer upstream of nucleotide 55 contains a BamHI site positioned such 

20 that, when upstream sequences encoding the Vegl DNA-binding domain are flised to 

BAFl 55 sequences, the translational reading firame is preserved. Furthermore, the primer 
downstream of nucleotide 3366 contains a Hindlll site just downstream of the final codon 
of BAF155 positioned such that, if BAF155 sequences are fused to downstream 
sequences encoding a FLAG epitope tag, the translational reading firame is preserved. 

25 PGR is performed using cDNA from HeLa cells as template. Amplified product 

having a size of approximately 3400 base pairs is gel-purified and cloned directly into a 
Topo2 cloning vector (Invitrogen, Carlsbad, CA). Site-directed mutagenesis is used to 
ehminate the BamHI site at BAFl 55 position 2304, without altering the coding capacity 
or translational reading firame of the gene. A similar approach is used to eliminate the 

30 Kpnl sites at nucleotides 2235 and 3243, and the HindHI sites at nucleotides 656 and 
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2365. The cloned aad modified BAF155 gene is then removed from the Topo2 vector by 
digestion with BamHI and Hindin, and gel-purified. 

The expression vector is modified from pcDNA3. l(-) (Invitrogen, Carlsbad, CA), 
by digesting it with EcoRI and Hindlll, and inserting a double-stranded oligonucleotide 
5 encoding an EcoRI site, a translation initiation sequence (Kozak (1991) J. BioL Chem. 
266:19,867-19,870), a nuclear locaUzation signal (NLS), aKpnl site, a BatnHI site and a 
Hindlll site. The NLS is derived from the SV40 large T-antigen (Kalderon et al (1984) 
Cell 39:499-509), and has the amino acid sequence MAPKKECRKVGIHGV (SEQ ID 
NO: 13). 

10 This plasmid is then digested with BamHI and Hindlll, and the BamHI-Hindlll 

fragment comprising the BAF155 gene {supra) is inserted. A double-stranded 
oligonucleotide encoding a FLAG epitope (having the sequence D YKDDDDK, SEQ ID 
NO: 14), and containing Hindin sites at both ends is inserted concurrently with the 
BAF 155 -containing fragment. Alternatively, the FLAG-contaioing HindUI fragment can 

15 be inserted in a separate, subsequent ligation. The resulting construct comprises, in 

order, CMV immediate early promoter, EcoRI site, translation initiation sequence, SV40 
large T-antigen nuclear locahzation sequence, Kpnl site, BamHI site, liBAF155 coding 
sequence, Hindin site, FLAG epitope, Hindlll site, bovine growth homione (bGH) 
polyademylation signal, in apcDNAS.l (Invitrogen, Carlsbad, CA) plasmid backbone. 

20 The CMV promoter and bGH polyadenylation signal are derived from tlie original 
pcDNA3.1 vector, as are sequences for replication and selection. 

Next, the Vegl ZFP DNA-binding domain (see Example 1) is inserted, as a Kpnl- 
BamHI fragment, into the vector described in the preceding paragraph to generate a 
vector encoding a protein having the structure (from N- to C-temiinus): Nuclear 

25 localization sequence - Vegl DNA binding domain - hBAF155 - FLAG epitope tag. The 
integrity of these constructs, and the preservation of the reading frame, is confirmed at 
each step of the procedure by nucleotide sequence analysis. Upon transfection into 
mammaJian cells this vector produces a NLS-Vegl-BAF155-FLAG fusion, whose 
transcription is controlled by a CMV immediate early promoter and a bovine growth 

30 hormone polyadenylation signal. 
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Similar procedures are used to constmct a plasmid encoding a fusion of a DNA- 
biiiding domain with any component of a chromatin remodeling complex. In brief, a 
polynucleotide encoding a component of a chromatin remodeling complex (or a 
functional fiagment thereof) is obtained by PGR from cDNA (or optionally genomic 
5 DNA) using primers containing flanking BamHI and Hindm sites. BamHT, Kpnl and 
HindUE sites, if present in the amplified product, are removed by site-directed 
mutagenesis, preserving the reading frame and coding capacity in the process. The 
amplified gene is introduced into a BamHI/Hindlll-digested expression vector 
constructed as described above, optionally along with a Hindlll fragment containing a 

10 FLAG epitope. The resulting construct is digested with Kpnl and BamHI and a 
KpnI/BamHI fragment, encoding a DNA-binding domain, preferably a ZFP DNA- 
binding domain, is inserted. Sequences encoding nuclear localization sequences and 
FLAG epitopes, for immunological detection of the fusion protein, are optionally 
included in the construct. 

15 Plasmids encoding these fusions are propagated in any suitable host strain, 

preferably E. coli strains JM109 or HBIOI. 

Example 3: Design and synthesis of a six-finger ZFP DNA-binding domain 
wliich recognizes target sequences in the human VEGF gene 

20 Tars^et site 

A zinc finger DNA-binding domain, wliich recognizes the human vascular 
endothelial gro^vth factor-A (VEGF) gene, was designed and constructed according to 
design rules and methods disclosed in co-o^A/ned WO 00/42219, WO 00/41 566, and co- 
owned U.S. Patent Applications Serial Nos. 09/444,241 filed November 19, 1999 and 
25 09/535,088 filed March 23, 2000. The target site, which overlaps the transcription 

initiation site for the human VEGF-A gene, is sliown below as SEQ ID NO: 15, with the 
arrow indicating the transcription startsite. 

□ 

5 ' -GGGGAGGATCGCGGAGGCT-3 ' 
30 3' -CCCCTCCTAGGGCCTCCGA-5' {SEQ ID NO: 15) 
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Backbone structure 

Amino acids 533-624 of the human SP-1 zinc finger transcription factor were 
used as backbone for the construction of a designed six-finger DNA binding domain, 
Veg3a/1, capable of recognizing this sequence. 
5 Amino acid sequences of the designed DNA-binding domains are illustrated in 

Table 1, As can be seen in the Table, the designed Veg3a/1 protein comprises two 
subdomains, Vegl and Veg3a, each comprising three zinc fingers (Fl, F2 and F3) and 
each recognizing a 9-base pair subsite of the target site, joined by the linker sequence 
DGGGS (SEQ ID NO: 16). The amino acid sequence of the recognition helix (positions 
10 -1 through+65 where +1 is the first amino acid in the a-helix) for each of the DNA- 
bmding fingers is given in Table 1. 



Sequences encoding Vegl and VegSa subdomains 

Synthesis of the Vegl bmding domain was described in Example 1. Assembly of 
15 the Veg3a coding sequences was carried out as described above for Vegl (Example 1 and 
Figure 1) except tliat different specific oligonucleotides were used to encode the Veg3a 
recognition helices. 

Accuracy of the Veg3a clone was verified by DNA sequencing. The Veg3a 
nucleotide and amino acid sequences are as follows. 
20 Veg3a nucleotide sequence: 

Kpnl 

GGTACCCATACCTGGCAAGAAGAAGCAGCACATCTGCCACATCCAGG 
GCTGTGGTAAAGTTTACGGCCAGTCCTCCGACCTGCAGCGTCACCTGCGCTG 
GCACACCGGCGAGAGGCCTTTCATGTGTACCTGGTCCTACTGTGGTAAACGCT 
25 TCACCCGTTCGTCAAACCTACAGAGGCACAAGCGTACACACACCGGTGAGAA 
GAAATTTGCTTGCCCGGAGTGTCCGAAGCGCTTCATGCGAAGTGACGAGCTG 
TCACGACATATCAAGACCCACCAGAACAAGAAGGGTGGATCC 
(SEQ ID NO: 17) BamHI 
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Veg3a amino acid sequence: 

WIPGKKXQfflCmOGCGKVY GOSSDLQK EILRWHTGBRPFMCTWSVnG^ 
RF TRSSNLORH KRTHTGEKKFACPECPKKE MRSDELSR^ 
(SEQ ID NO: 18). 

5 The recognition regions of the VegSa polypeptide (amino acids -1 through +6 of 

the zinc finger recognition helices) are shown in bold rniderline. 

Determination ofVegSa binding affyiitv 

The puriJQed Veg3a zinc finger DNA-binding domain is tested for affinity to its 
10 DNA target site by electrophoretic mobility shift analysis. A double-stranded target 

oligonucleotide was constructed by annealing complementary 29-mers, then end-labeled 
using polynucleotide kinase and y-^^P-ATP. The sequence of the target (with VEGF 
sequences in bold and target sites under/overlined) was as follows: 

15 5 ' - CATGCATATCGCGGAGGCTTGGCATCGAT ~ 3 ' 

3 ' -GTACGTATAGCGCCTCCGAACCGTAGCTA-5 ' 
(SEQ ID NO: 19) 

20 Fusion o fVesl and Ves3a subdomains to form a Veg3a/1 bindins domain 

Vegl and Veg3a binding subdomains were joined to each other^ using the linker 
sequence DGGGS (SEQ ID NO: 16). This particular Unker sequence was chosen 
because it pemiits binding of two three-finger ZFP binding subdomains to two 9"bp sites 
that are separated by a one nucleotide gap, as is the case for the Vegl and Veg3a sites. 
25 See also Liu et al., supra. 

The 6-finger Veg3a/1 protein encoding sequence was generated as follows. 
Sequences encoding VegSa recognition helices were PCR-amplified firom the VegSa- 
encoding vector {supra) using the primers SPE7 

(5'-GAGCAGAATTCGGCAAGAAGAAGCAGCAC) (SEQ ID NO: 20) and 
30 SPEampl2 (5 '-GTGGTCTAGACAGCTCGTCACTTCGC) (SEQ ID NO: 21) to 
generate a double-sti-anded jfragment bounded by EcoRI and Xbal restiiction sites 
(underlined in the sequences of the primers). The amplification product was digested 
with EcoRI and Xbal. Sequences encoding Vegl recognition helices were PCR- 
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amplified from the Vegl-encoding vector (Example 1) using the primers SPEamplS 
(5'-GGAGCCAAGGCTGTGGTAAAGTTTACGG) (SEQ ID NO: 22) and SPEampl 1 
(5'-GGAGAAGCTTGGATCCTCATTATCCC) (SEQ ID NO: 23) to generate a double- 
stranded amplification product bounded by Sfyl and HindlH restriction sites (underlined 
5 in the sequences of the primers). The resulting amplification product was digested with 
Styl and Hindlll. A third double-stranded fragment was constructed, using synthetic 
oligonucleotides, which encodes the DGGGS Imkety flanked by the remainders of the 
Vegl and Veg3a DNA-binding domains, and bounded by Xbal and Styl sites. The 
sequence of this third fragment is as follows, with the Xbal and Styl sites underlined: 
10 Xbal 

3 ' TGTGTAGTTTTGGQTGGTCTTGTTCTTTCTGCCGCCACCG 

AGCGGCAAAAAGAAACAGCACATATGTCACATC 3' SEQ ID NO: 24 

15 TCGCCGTTTTTCTTTGTCGTGTATACAGTGT AGGTTC 5' SEQ ID NO: 25 

Styl 

These thi^ee fragments were ligated to one another, the ligation product was 
amplified with primers SPE7 and SPEampl 1, and the resulting amplification product was 
20 digested with EcoRI and Hindm and cloned into the EcoRI and Hindlll sites of pUCl 9. 

The linked VegSa- and Vegl-encoding sequence, in the pUC19 backbone, was 
then amplified with the primers GB19 

(5'-GCCATGCCGGTACCCATACCTGGCAAGAAGAAGCAGCAC) (SEQ ID 
NO: 26)andGB10 

25 (5'-CAGATCGGATCCACCCTTCTTATTCTGGTGGGT (SEQ ID NO: 27) to 

introdxice Kpnl and BamHI sites at the ends of the amplification products (restriction 
sites underlined). The amplification products were then digested with Kpnl and BamHI, 
and cloned into the modified pMAL-c2 expression vector described above. 

The nucleotide sequence encoding the designed 6-finger ZFP VegSa/l, from the 

30 Kpiol site to the BamHI site is: 

GGTACCCATACCTGGCAAGAAGAAGCAGCACATCTGCCACATCCAGGGCTGT 
GGTAAAGTTTACGGCCAGTCCTCCGACCTGCAGCGTCACCTGCGCTGGCACA 
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CCGGCGAGAGGCCTTTCATGTGTACCTGGTCCTACTGTGGTAAACGCTTCACA 
CGTTCGTCAAACCTACAGAGGCACAAGCGTACACACACAGGTGAGAAGAAA 
TTTGCTTGCCCGGAGTGTCCGAAGCGCTTCATGCGAAGTGACGAGCTGTCTAG 
ACACATCAAAACCCACCAGAA.CAAGAAAGACGGCGGTGGCAGCGGCAAAAA 
5 GAAACAGCACATATGTCACATCCAAGGCTGTGGTAAAGTTTACGGCACAACC 
TCAAATCTGCGTCGTCACCTGCGCTGGCACACCGGCGAGAGGCCTTTCATGT 
GTACCTGGTCCTACTGTGGTAAACGCTTCACCCGTTCGTCAAACCTGCAGCGT 
. CACAAGCGTACCCACACCGGTGAGAAGAAATTTGCTTGCCCGGAGTGTCCGA 
AGCGCTTCATGCGTAGTGACCACCTGTCCCGTCACATCAAGACCCACCAGAA 

10 TAAGAAGGGTGGATCC (SEQ ID NO: 28) 

The VEGF3a/l amino acid sequence (using single letter code) is: 
VPIPGKKKQfflCHIQGCGKVYGQSSDLQRHLRWHTGBRPFMCTWSYCGKR^ 
SNLQRHKRTHTGEKKFACPECPKSLFMRSDELSRHIKTHQ^^^ 
CmQGCGKWGTTSNLIOUII^WHTGERPFMCTW 

15 TGEKKFACPECPKRFMRSDHLSRHnCTHQNKKC^^ (SEQ ID NO: 29) 

The Veg3a/1 protein was expressed in E. coli as an MBP fusion, purified by 
affinity chromatography, and tested in EMSA experiments as described supra, A labeled 
double-stranded oligonucleotide comprising the target site was prepared by synthesis and 
annealing of two overlapping oligonucleotides, one of which was labeled with ^^P. The 

20 oligonucleotides comprised the following sequences (with the target site 
over/underlined) : 

AGCQAGC GGGGAGGATCGCGGAGGCT TGGGGCAGCCGGGTAG (SEQ ID WO: 3 0) 

TCGCCCCTCCTAGCGCCTCCGAACCCCGTCGGCCCATCTGGC (SEQ ID NO: 31) 

25 

Binding analysis was conducted as described in Example 1 for the Vegl protein< 
Binding was allowed to proceed for 60 min at either room temperature or 37^C, and 
polyacrylamide gel electrophoresis was carried out at room temperature or 37°C using 
precast 10% or 10-20% Tris-HCl gels (BioRad) and standard Tris-Glycine running 
30 buffer. The room temperature assays yielded an apparent (detemiined as described 
supra) for this Veg3a/1 protein of approximately 1 .5 nM, When binding and 
electrophoresis were performed at 37^*0, the apparent Kd of VegSa/l was approximately 9 
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hM when tested against the 18-bp target. Thus, the six finger Veg3a/1 ZFP bound with 
high affinity to its target site. 

Example 4: Construction of a gene encoding a fusion between a ZFP DNA 
5 binding domain and a methyl-binding domain protein 

A plasmid encoding a fusion between the human MBDl gene and the Veg3a/1 
DNA-binding domain is constmcted using methods similar to those described above for 
the BAF155A^ egl fusion (Example 2). Sequences encoding MBDl (GenBank accession 
No. NM015846) are isolated by PGR from genomic DNA or cDNA. Amplification 

10 primers are designed such that the primer corresponding to the upstream region of the 
gene comprises a BamHI Site at or near its upstream terminus, aiid the primer 
corresponding to the downstream region of the gene comprises a HindHI site at or near its 
downstream terminus. The primers are designed to amplify the region between 
nucleotides 140 (MBDl initiation codon) and 1,957 (MBDl termination codon), and to 

15 retain the correct reading fi-ame of the MBDl gene when the amplification product is 
incorporated as a component of a fusion gene. The amplification product is optionally 
clonedj a BamHI site at nucleotide 264 of the MBDl sequence is removed by site- 
specific mutagenesis, and the BamHI/HindHI fragment is released from the cloning 
vector and purified. Sequences encoding the Veg3a/1 DNA-binding domain are obtained 

20 as a Kpnl/BamHI fragment (Example 3). The MBDl-encoding BamHI/Hindlll fragment 
and the Veg3 a/1 -coding KpnI/BamHI fragment are inserted into pcDNA3.1(-) or a 
modified derivative (Example 2). A nuclear locaUzation signal and/or a FLAG epitope 
are optionally included in the fusion construct. 

The MBDl gene can be divided into at least two functional fragments: a 

25 methylated DNA binding domain (encoded by nucleotides 158-322) and a functional 
domain. Accordingly, a MBD/ZFP fusion gene is constructed that lacks sequences 
encoding the methylated DNA-binding domain, but contains the functional domain of the 
MBDl protein. In this case-, the BamHI/Hindni-tenninated ampUfication product 
comprises nucleotides 322 through 1,957 of the MBDl gene. 

30 A similar fusion gene is constmcted, in which the MBD2 gene (GenBank 

accession No. NM003927), or a functional fragment thereof, is fused to a ZFP DNA- 
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binding domain. la this case, the amplification primers are designed to amplify the 
region between nucleotides 230 (MBD2 initiation codon) and 1,465 (MBD2 termination 
codon), and to retain the correct reading frame of the MBD2 gene when the amplification 
product is incorporated as a component of a fusion gene. The amplification product is 
5 optionally cloned, a Kpnl site at nucleotide 813 and a HindlU site at nucleotidel308 of 
the MBDl sequence are removed by site-specific mutagenesis, and the BamHI/Hindlll 
fragment is released from the cloning vector and purified. Sequences encoding the 
Veg3a/1 DNA-binding domain are obtained as a Kpnl/BamHI fragment (Example 3). 
The MBD2-encoding BamHI/Hindlll fragment and the Veg3a/l-coding KpnI/BamHI 

10 fragment are inserted into pcDNA3. l(-) or a modified derivative (Example 2). A nuclear 
localization signal and/or a FLAG epitope are optionally included in the ftision construct. 

The methylated DNA-binding domain of the MBD2 gene is encoded by 
nucleotides 680-862. Accordingly, a MBD/ZFP fusion gene is constructed that lacks 
sequences encoding the methylated DNA-binding domain, but contains the fimctional 

15 domain of the MBD2 protein, by designing the amplification primers to amphfy the 
region of the MBD2 gene located between nucleotides 862 and 1,465. As in previous 
examples, the amplification primers comprise BamHI and HindUI sites at or near their 
termini, to maintain the MBD2 reading frame and facilitate construction of the ftision 
protein by the methods described supra. In this case, the Hindin site at nucleotide 1,308 

20 is removed subsequent to amplification and prior to constmction of the fiision nucleic 
acid. 



Example 5: Introduction of fusion molecules into cells 

Human embryonic kidney cells (HEK 293) are grown in DMEM (Dulbecco's 
25 modified Eagle medium) supplemented with 10% fetal calf serum. Cells are plated in 
10 cm dishes at a density of 2.5 x 10^ per plate and grown for 24 hours m a CO2 
incubator at 3TC, For transfection, 10 jxg of plasmid DNA is diluted in 2.5 ml Opti- 
MEM (Life Technologies), and 50 \il of Lipofectamine 2000 is diluted in 2,5 ml Opti- 
MEM. The diluted DNA and lipid are mixed and incubated for 20 minutes at room 
30 temperature. Medium is then removed from the cells and replaced with the lipid/DNA 
mixture. Cells are incubated at 3TC for 3 hours in a CO2 incubator, then 10 ml of 
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DMEM+10% FBS is added. Cells are harvested 40 hours after transfectioxi for analysis 
of chromatm structure (Example 6) and gene expression (Example 7). 

Intercalator-protein fusions, MGB-protein fusions and/or TFO-protein fusions are 
introduced into cells after encapsulation into liposomes, using standard procedures that 
5 are well-known in the art. 

Example 6: Assays for chromatin remodeling 

Recruitment of a chromatin remodeling complex to a region of interest in cellular 
10 chromatin, by a fusion molecule comprising a DNA-binding domain and a component of 
a chromatin remodeling complex, is evidenced by alteration of chromatin structure in the 
region of interest. Alteration of chromatin structure mediated by the Vegl-BAF155 
fusion molecule described supra (Example 2) is assessed by rnvestigatiag nuclease 
hypersensitive sites in the vicinity of the Vegl binding site, as described in this example. 

15 

Cell Growth and isolation of nuclei for studies of nuclease hypersensitivity 
Transformed himian embryonic kidney 293 cells are grown in DMEM + 10% 
fetal calf serum, supplemented with penicillin and streptomycin, in a 3TC incubator at 
5% CO2. Typically, two 255 cm" plates of cells are used in an experiment. When the 

20 cells reach greater than 90% confluence (-2.5 x 10^ cells per plate), medium is removed 
and the cells are rinsed twice with 5 ml of ice-cold PBS (Gibco/Life Technologies, 
Gaithersburg, MD). Cells are then scraped from the plates in 5 ml of ice-cold PBS and 
combined in a 50 ml conical centrijEuge tube. The plates are washed with 10 ml of ice- 
cold PBS and the washes ai'e added to the tube. Nuclei are pelleted by centrifiigation 

25 (1400 rpm for 5 min) and the supernatant is removed. The pellet is mixed by vortexing 
and, while vortexing, 20 ml of lysis buffer (10 mM Tris pH 7.5, L5 mM MgCl2, 10 mM 
KCl, 0,5% IGBPAL CA-630 (Sigma), 1 mM phenylmethylsulfonyl fluoride, 1 mM 
dithiothreitol) is added. The cell pellet is resuspended in lysis buffer by pipettmg and the 
tube is centrifuged at 1400 rpm for 5 min. The supernatant is removed and the pellet is 

30 resuspended in 20 ml of lysis buffer and centrifuged as before. The final pellet is 

resuspended in 1.5 ml dilution buffer (15 mM Tris pH 7.5, 60 mM KCl, 15 mM NaCl, 
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5 mM MgPh, 0. 1 mM dithiofhreitol, 10% glycerol), nuclei are counted in a microscope 
and the solution is adjusted so that a concentration of approximately 10^ nuclei per ml is 
obtained. 

5 DNase treatment of nuclei 

Nuclei, at a concentration of 10^ per ml in dilution buffer, are digested with 
different concentrations of DNase 1. DNase I dilutions are prepared by diluting 
deoxyribonuclease I (Worthington, Freehold, NJ) in dilution buffer (stiprd), optionally 
supplemented with 0,4 mM CaCl2. To 100 \il of resuspended nuclei is added 25 |li1 of a 

10 DNase I dilution to give final DNase I concentrations ranging from 0.07 Units/ml to 

486 Units/ml in three-fold concentration increments. Digestions are conducted at room 
temperature for 5 min. Digestion reactions are then stopped by addition of 125 ^1 of 
Buffer AL (Qiagen DNeasy™ Tissue Kit) and 12.5 fxl of a 20 mg/ml solution of 
Proteinase K (Qiagen DNeasy™ Tissue Kit), followed by incubation at 70°C for 10 min. 

15 Digested DNA is purilBed using the DNeasy™ Tissue Kit (Qiagen, Valencia, CA) 
according to the manufacturer's instructions. 

Purified DNase-treated DNA is digested with restriction enzyme at 37^C 
overnight with 40 Units of restriction enzyme in the presence of 0.4 mg/ml RNase A- 
After digestion, DNA is ethanol-precipitated from 0.3 M sodium acetate. 

20 Micrococcal nuclease can be used as an altemative to DNase I for examination of 

chromatin structure. Treatment of nuclei, obtained as described supra^ with micrococcal 
nuclease is conducted as described by Livingstone-Zatchej et aL m Methods in Molecular 
Biology, Vol. 119, Humana Press, Totowa, NJ, pp. 363-378. 

25 Treatment of nuclei with a chemical probe 

Nuclei are treated with MPE using the following procedure adapted from 
Cartwright et aL, supra. A freshly-diluted stock of 0.4 M H2O2 is prepared by making a 
25-fold dilution of a 30% stock solution. A freshly-prepared stock of 0.5 M ferrous 
ammonium sulfate is diluted 400-fold in water, A solution of methidiumpropyl EDTA 

30 (MPE) is prepared by adding 30 fxl of 5 mM MPE to 90ili1 of water. To this MPE 
solution is added 120 |al of tlie ferrous ammonium sulfate dilution and 2.5 |uil of 1 M 
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dithiotbreitol (DTT, freshly prepared from powder). To a suspension of nuclei, obtained 
as described supra^ are added, in sequence: 3.5 p,l of 0.4 M H2O2 and 37.5 \x\ of the 
MPE/ferrous ammonium sulfate/DTT mixture. The reaction is terminated after an 
appropriate time period (determined empirically) by addition of 40 |a.l of 50 mM 
5 bathophenanthroline disulfonate, 0.1 ml of 2.5% sodium dodecyl sulfate/50 mM 
EDTA/50 mM Tris-Cl, pH 7.5 and 10 \x\ of Proteinase K (10-14 mg/ml). Proteinase 
digestion is conducted at 'iTC for at least 8 hours and the mixture is then extracted twice 
with phenol/chloroform and once with chloroform. Nucleic acids are precipitated from 
the aqueous phase by addition of sodium acetate to 0.3 M and 0.7 volume of isopropyl 
10 alcohol, incubation on ice for at least 2 hr, and centrifugation. The pellet is washed with 
70% ethanoU dried, resuspended in 10 mM Tris-Cl, pH 8 and treated withRNase A 
(approximately 0.1 mg/ml) for 15 min at 37^C. 

Blotting and hybridization 

1 5 Pellets of precipitated, digested DNA, obtained after treatment with enzymatic or 

chemical probes as described supra^ are resuspended in 22 ^il of loading buffer 
containing glycerol and tracking dyes ("Gel loading solution/* Sigma Chemical Corp., St. 
Louis, MO) and incubated at 55^C for 3-4 hours. Twenty microliters of resuspended 
sample is loaded onto a 1% agarose gel containing IX TAE buffer and 0.5 |ag/ml 

20 ethidium bromide, and electrophoresis is conducted at 22 Volts for 16 hours in Tris- 
acetate-EDTA buffer. After electrophoresis, the gel is treated with alkali, neutralized, 
blotted onto aNytran membrane (Schleicher & Schuell, Keene, NH), and the blotted 
DIS A is crosslinked to the membrane by ultraviolet irradiation. 

Probes are labeled by random priming, using the Prime-It Random Primer 

25 Labeling Kit (Stratagene, La Jolla, CA) according to the manufacturer's instmctions. In a 
typical labeling reaction, 25-50 ng of DNA template is used in a fmal volume of SO p-L A 
specific activity of 10^ cpm/|Lig is typically obtained. Labeled probes are purified on a 
NucTrap probe column (Stratagene #400702, La JoUa, CA). 

The membrane is placed in a hybridization bottle and pre-hybridized in Rapid 

30 Hybridization Buffer (Amersham, Arlington Heights, IL) at 65°C for 15 min. Probe (a 
0.1 kb Xbal-Kpnl fragment, see Figure lA) is added (approximately 0.03 |xg containing 
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approximately 3.3 x 10^ cpm) and hybridization is conducted at 65°C for 2 hours. 
Following hybridization, the membrane is washed once at 65°C for 10 min. with 2X SSC 
+ 0,1% SDS, and twice at 65°C for 10 min. with O.IX SSC + 0.1% SDS. The membrane 
is then dried and analyzed either by autoradiography or with a phosphorimager. 
5 Results are shown in Figure 3 for analysis of DNase hypersensitivity, in HEK293 

cells, within an approximately 1,000 base-pair region upstream of the hxraian VEGF-A 
gene transcriptional startsite. Increasing DNase concentration resulted in the generation 
of two new sets of DNA fragment doublets, centered at approximately 500 and 1,000 
nucleotides, indicating the presence of two DNase hypersensitive regions. One of these 
10 regions is centered approximately 500 base pairs upstream of the transcriptional startsite; 
the other is centered on the transcriptional startsite. 

RemodeUng of VEGF chromatin can involve, among other things, loss of one or 
both of these hypersensitive regions, or the generation of one or more additional 
hypersensitive regions, either upstream or downstream of the transcriptional startsite. 

15 

Example 7: Assays for modulation of gene expression 

General Activation or repression of transcription resulting from localized 
chromatin remodeling is determined by measurement of RNA and/or protein gene 
products. These methods are well-known to those of skill in the art. 
20 For example, Mizuguchi et a/ (1 999) in Methods in Molecular Biology, Vol. 119, 

"Chromatin Protocols" (P.B. Becker, ed.) Humana Press, Totowa, 1999, pp. 333-342 
describe a procedure for in vitro transcription of chromatin that has been remodeled by 
the Drosophila NURF complex. This assay can be used to detect changes in 
transcriptional properties of chromatin (either activation or repression) following 
25 chromatin remodeling. 

Production and specificity of RNA can also be measured by RNA blots, nuclease 
protection and/or quantitative real-time PGR (colloquially known as the **Taqman" 
assay), as is known to those of skill in the art. See, for example, Ausubel et aL, supra. 
Protein production can be measured by immunoassay {e.g., ELISA, 
30 immunoprecipitation), gel electrophoresis, and/or inamunological detection of protein 
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blots ("Western" blots), as is known to those of skill in tlie art. See, for example, 
Ausubel et aL^ supra. 

Reporter genes, either chromosomal or extrachromosomal, can also be used to 
assay activation and/or repression of specific promoters. Accordingly, effect of 
5 chromatin remodeling on a promoter that is operatively linked to a reporter gene (such as, 
for example, alkaline phosphatase, p-galactosidase, p-glucuronidase, chloramphenicol 
acetyl transferase, horseradish peroxidase, luciferase, or green fluorescent protein) can be 
assayed by measuring the levels and/or activity of the reporter. Methods for fusion of a 
promoter to a reporter gene, and methods for assay of reporter gene products, are known 

10 to those of skill in the art. See, for example, Ausubel et aL, supra. 

RNA analysis. Transient transfection of HEK293 cells, seeded in 6-well plates, is 
carried out as described in Example 6, supra. Cell lysates are harvested 40 hours post- 
transfection. To assay the activation of the endogenous chromosomal VEGF gene, RNA 
blotting ("Northern" blotting) is used to measure VEGF mRNA levels. Briefly, PolyA+ 

1 5 RNA is isolated from HEK 293 cells transfected with a fusion plasmid or firom mock- 
transfected HEK293 cells, using the Oligotex kit (Qiagen^ Valencia, CA), according to 
the manufacturer's instractions. The fusion plasmid encodes a fusion protein comprising 
a nuclear locahzation sequence, the Vegl DNA-binding domain, BAF155 and a FLAG 
epitope (see Example 2, supra). 7p,g of RNA are resolved on a 2.4% agarose gel 

20 containing 2.4 M formaldehyde, and the gel is blotted onto Nytraii SuPerCharge 
membrane (Schliecher & Schuell, Keene, NH) using 20X SSC. The membrane is 
hybridized at 65°C for 1 hour in Rapid-Hyb Buffer (Amersham-Phannacia Biotech, 
Piscataway, NJ) containing a ^^P-labeled VEGF cDNA probe. The VEGF cDNA 
construct is generated by inserting a human VEGF cDNA fragment, obtained by PGR 

25 amplification, into tlie pCDNAS.l vector (Invitrogen, Carlsbad CA) at the Xbal and 

EcoRI sites. Stmcture of the clone is confirmed by sequencing. After hybridization, the 
VEGF probe is stripped from the membrane, and the blot is re-hybridized with a ^^P- 
labeled GAPDH DNA probe. VEGF mRNA levels, as determined by RNA blotting, are 
normalized to GAPDH mRNA levels. 

30 For real-time quantitative PGR ('Taqman") analysis of mRNA abundance, total 

cellular RNA from transfected HEK 293 cells is isolated using the Rneasy Kit (Qiagen, 
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Valencia, CA). RNA samples (25 ng) are mixed with 0.3 |j,M of each primer, 0.1 iiiM of 
probe, 5,5 mM MgCl2, 0.3 mM of each dNTP, 0.625 unit of AmpliTaq Gold RNA 
Polymerase, 6.25 xmits of Multiscribe Reverse Transcriptase, and 5 units of RJSfase 
inhibitor, in Taqman buffer A from Perkin Elmer. Reverse transcription is performed at 
5 48°C for 30 min. After denaturing at 95°C for 10 minutes, PGR is conducted for 40 

cycles at 95°C for 15 seconds and 60°C for one minute. Analysis is conducted, during the 
amplification reaction, in a 96-weIl forniat on an ABI 7700 SDS machine (PE 
BioSystems, Foster City, CA) and data is analyzed with SDS version 1 .6.3 software. 
Exemplary probes and primers for analysis of VEGF and GAPDH genes are presented in 
10 Table 2. 



Table 2: Primer and Probe sequences for hydrolyzable probe analysis 



G^ene 


Forward primer 


Reverse primer 


Probe 


VEGF 


5'-CTG<jTAGCGGGG 
AGGATCG-3' 
(SEQ ID NO: 32) 


5'-GCCACGACCTCCG 

AGCTAC-3' 

(SEQ ID NO: 33) 


5'-CTACCCGGCTGC 
CCCAAGCCTC-3' 
(SEQ ID NO: 34) 


GAPDH 


5 '-CCTTTTGCAGACC 
ACAGTCCA-3' 
(SEQ ID NO: 35) 


5'-GCAGGGATGATGT 
TCTGGAGA-3' 
(SEQ ID NO: 36) 


5 '-CACTGCCACCCA 
GAAGACTGTGG-3' 
(SEQ ID NO: 37) 



Protein analysis. Analysis of protein levels is performed by resolving 10 |Lig of 
whole cell lysate on a 10-20% polyacrylamide gel run in Tris/glycine/SDS buffer 

15 (BioRad, Hercules, CA). Protekis separated in the gel are transferred onto a 

nitrocellulose membrane using Tris/glycine/SDS buffer supplemented with 20% 
methanol, and the filter is blocked with 5% non-fat dry milk for 1 hour at room 
temperature. The blot is probed for 1 hoiu: at room temperature witii anti-Flag M2 
monoclonal antibody (Sigma, St. Louis, MO) diluted 1: 1000 in 5% (w/v) non-fat dry 

20 milk/0. 1 % PBS-Tween^ then washed twice for 5 sec and once for 1 5 min with 0. 1 % PBS- 
Tween, All washes are performed at room temperature. The blot is then incubated for 
one hour at room temperature with a horseradish peroxidase-conjugated anti-mouse 
antibody (Amersham-Pharmacia Biotech, Piscataway, NJ), used at a 1 :3G00 dilution in 
5% (w/v) non fat dry milk /0. 1% PBS Tween. This is followed by two 5 sec washes and 

25 one 15 min wash with 0,1% PBS-Tween. Protein bands are detected using the ECL 
system (Amersham-Pharmacia Biotech, Piscataway, NJ). 
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For analysis of protein level by ELIS A, cell lysates are prepared (as described 
above) or culture medium is harvested and analyzed using a commercially available 
ELISA kit. For example, levels of secreted VEGF protein are determined by assay of 
culture medium using a human VEGF ELISA kit (R & D systems, Minneapolis, MN). 
5 Results, Transfection of the Vegl/hBAF155 fiision construct (Example 2) into 

cultmred HEK 293 cells results in activation of VEGF gene expression, compared to 
untransfected cells, as evidenced by increases in VEGF mRNA and protein levels. 
Vectors lacking the ZFP and/or BAF155 portions of the fusion are used as controls. 
Transfection efficiency is measured by co-transfection of a green fluorescent protein 
10 expression vector. A mock transfection control is also carried out. 

Introduction of the Veg3a/1-MBD1 or Veg3a/1"MBD2 fusion comiract into 
cultured HEK 293 cells by transfection results in repression of VEGF gene expression, 
compared to mtransfected cells, as evidenced by decreases in VEGF mRNA and protein 
levels. Controls similar to those described above are also conducted, 

15 

Example 8: Assays for chromatin remodeling complexes 

Methods for the purification, assay and characterization of various chromatin 
remodeling complexes are well-known to those of sldll in the art. See, for example^ 
Methods in Enzymology^ Vol. 304, "Chromatin" (P.M. Wassarman and A. P. Wolffe, 

20 eds.), Academic Press, San Diego, 1999; and Methods in Molecular Biology, Vol. 1 19, 
"Chromatin Protocols" (P.B. Becker, ed.) Humana Press, Totowa, 1999. 

Chromatin remodeling can take the form of, for example^ deposition^ removal or 
repositioning of nucleosomes within chromatin. Means for detecting chromatin 
remodeling iaclude, but are not limited to, detecting changes in accessibility of specific 

25 sites in chromatin to sequence-specific nucleases such as restriction enzymes, 

determination of the appearance or disappearance of a regularly repeating pattem of 
chromatin digestion by non-sequence speciJBc endonucleases such as micrococcal 
nuclease and DNase I, determination of nucleosome spacing, and nucleosome-binding 
assays. Also, as mentioned supra, chromatin remodeling complexes possess ATPase 

30 activity; ttierefore ATP hydrolysis assays can be used in the identification and/or 
characterization of chromatin remodeling complexes. 
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Restriction endonuclease accessibility assays are described by Logie et aL, supra 
and Varga-Weisz et al (1999) Meth, Enzymology 304:742-757, Assays for nucleosome 
spacing, DNase I accessibility, ATPase activity and nucleosome binding are disclosed by 
Varga-Weisz al, supra. Assays to detect facilitation of transcription factor binding are 
5 described by Cote et al (1994) Science 265:53-60 and Kwon et al (1994) Nature 
370:477-481 . Assays for nucleosome repositioning (i.e., "sliding") are disclosed by 
Hamichee^^/. (1999) Ce// 97:833-842. 

These assays, and others, can be used for the purification and characterization of 
chromatin remodeling complexes from various species, for example, the yeast SWI/SNF 
10 complex (Logie et al, supra\ the Drosophila CHRAC complex (Varga-Weisz et al, 
supra) and the Drosophila NURP complex (Sandaltzopoulos et ah, supra). 

Example 9: ATPase assay 

Chromatin remodeling complexes utilize the energy of ATP hydrolysis to modify 

15 chromatin structure. Consequently, nucleosome- or DNA-dependent ATPase activity can 
be used to assay for a chromatin remodeling complex- 
Methods and compositions for conducting ATPase assays are well-known to those 
of skill in the art. One measure of ATPase activity is the release of labeled 
pyrophosphate from y-^^P-labeled ATP. Release is measured as the amount of 

20 radioactivity that does not bind to activated charcoal in 2QmM phosphoric acid 

An alternative method for measuring pyrophosphate release is to measure labeled 
pyi*ophosphate directly by thin layer chromatography. The reaction mixture contains 
0.02 ^ig/ml DNA (or reconstituted nucleosomal array, see Example 11 infra), 5 iilVI 
SWI/SNF complex (or any other known or putative chromatin remodeling complex), 

25 20 mM Tris, pH 8.0, SmMMgCk, 0.2 mM dithiothreitol, 0,l%Tween, 5% glycerol, 
100 |Lig/ml bovine serum albumin, 100 fiM ATP, and 0.2 |LiCi (y-^^P)ATP (3 Ci/mmol) in 
a final volume of 20 |li1 and is incubated at 37°C. At the conclusion of the assay (imder 
these conditions the reaction rate is linear for 5-10 minutes), 1 |ll1 is pipetted onto a 
polyethyleneimine cellulose sheet and the sheet is developed in a solution of 0.75 M 

30 potassium phosphate, pH 3.5. In this system, ATP and pyrophosphate are clearly 
resolved from each other and from the origin. Quantitation is carried out either by 
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autoradiography followed by excision and scintillation cotinting of labeled spots^ or by 
phosphorimaging. 

The preceding naethods are adapted from those described by Logie et aL (1999) 
Metk Enzynnology 304:726-741. 
5 An alternative solvent for thin-layer chromatography is 0.5 M LiCl/1 M formic 

acid. In this system, pyrophosphate is separated from unhydrolyzed ATP, which remains 
at the origm. Varga-Weisz etal (1999) Meth, Enzymol 304:742-757. 

Example 10: Preparation of reconstituted nucleosome arrays 

10 Deposition of purified histone octamers onto a specific template under defined 

conditions can generate a nucleosomal array in which the positions of one or more 
individual nucleosomes, with respect to the nucleotide sequence of the template, are 
known. Such an array can be used as a substrate in an assay for chromatin remodeling 
activity, by testing for changes in nucleosome position with respect to nucleotide 

1 5 sequence. One such test is restriction endonuclease accessibility- See infra. 

Preparation of reconstituted nucleosome arrays can be conducted according to 
Logie et al^ supra and Varga-Weisz et al^ supra. Additional methods can be found in 
Methods in Enzymology, Vol. 304, "Chromatin" (P.M. Wassaiman and A. P. Wolffe, 
eds.). Academic Press, San Diego, 1999; and Methods in Molecular Biology^ Vol. 119, 

20 "Chromatin Protocols" (P.B. Becker, ed.) Humana Press, Totowa, 1999. 

Example 11: Construction of a gene encoding a fusion between a ZFP 
binding domain and the ISWI chromatin remodeling ATPase 

IS WI- encoding sequences were amplified from a recombinant plasmid encoding 
25 DrosophilalSWl. Coronae^^/. (1999) Mo/. Ce// 3:239-245. One of the primers 

contained, outside of the IS Wl-complementary region, sequences encoding a FLAG 
epitope and, at the 5' terminus, a 5' extension encoding Hind III and Xba I sites. The 
other primer contained a 5'extension encoding a Bam HI site. The sequences of the 
primers were as follows: 
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cgat cGGATCCTCCAAAACAGATACAGCTGCC 

BamHI ISWI seq 

(SEQIDNO:38) 

cratCQCC TCTAGA CTCGA GAAGCTTA CTTGTCATCGTCGTGCTTGTAGTGGCTGC CCTTCTTCTTCTTTTTCGAGTT 
Xbal Hindlll FLAG sequence ISWI seq 

(SEQ ID NO: 39) 

Amplification was conducted at 95^C for 2 min, followed by 30 cycles of 95^C 
for 30 sec, 55^C for 30 sec, 72°C for 5 min, and a final st^ of 72^C for 5 min. This 
resulted in the generation of an amplification product comprising ISWI- and FLAG- 
encoding sequences flanked by Bam HI and Hind HI sites. The amplification product 
was purified using a PGR Cleanup Kit (Qiagen, Valencia, CA) according to the 
manufacturer's instructions, then digested witili Bam HI and Hind III. 

A vector encoding a nuclear localization signal (NLS), a ZFP binding domain 
targeted to the human erythropoietin gene (Epo 2C), a VP16 activation domain and a 
FLAG epitope was digested with Bam HI and Hind III to release VP 16- and FLAG- 
encoding sequences. The Bam Hi/Hind HI fi-agment described in the preceding 
paragraph was ligated to the vector backbone to generate a vector encoding a fusion 
protein comprising a NLS, the Epo2C binding domain, ISWI and FLAG. A vector 
encoding a protein that is identical, except for the presence of an Epo3B binding domain 
in place of Epo2C, was constructed by similar methods for use as a control. The 
nucleotide sequences of the target sites, and the amino acid sequences of the recognition 
helices (-1 through 4-6) for the Epo2C and Epo3B binding domains are provided m 
Tables. 



Table 3: Target sites and recognition helix sequences for Epo2C and Epo3B 



ZFP 


Target 


Fl (-1 to +=6) 


F2 (-1 to +=6) 


F3 (-1 to +=6) 


Epo2c 


GGTGAGGAGT 
(SEQ ID NO: 40) 


RSDNALR 
(SEQ ID NO: 41) 


RSDNLAR 
(SEQ ID NO: 42) 


DSSKLSR 
(SEQ ID NO: 43) 


Epo3b 


GCGGTGGCTC 
(SEQ ID NO: 44) 


QSSDLTR 

(SEQ ID NO: 45) 


RSDALSR 
(SEQ ID NO: 46) 


RSDERKR 
(SEQ ID NO: 47) 
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Example 12: Activation of EPO expression by fusion of SRC-1 to a zinc 
finger binding domain 

The steroid receptor coactivator 1 (SRCl) protein is a histone acetyltraiisferase 
which is capable of recruiting the p300 and CBP proteins (both of which are also histone 
5 acetyltransferases). Liu et al (1999) Proc. Natl Acad. Set USA 96:9485-9490; 
Sheppard 6/ a/. (2001) Mo/. Cell Biol 21:39-50 and references cited therein) A 
construct encoding a portion of SRCl common to the a and e isoforms (amino acids 781 
through 1385, Kalkhoven et al (1998) EMBOJ, 17:232-243), fused to a zmc finger 
binding domain targeted to the human erythropoietin (EPO) gene, was constructed as 
10 follows. 

A plasmid encoding SRCl was used as a template for PGR amplification using 
the following primers, and the amplification product was digested with Not L 
5'-GGATCCGGCCACCGCGGCCGCATGGATCCATGTAATACAAACCCAACC 
(SEQIDNO: 48) 

15 5'-ATGAATTCGCGGCCGCCCTGGGTTCCATCTGCTTCTGTTTTGAG 
(SEQIDNO: 49) 

Tlie pVP16-BPOZFP-862c vector, containing a transcription unit encoding a 
nuclear localization signal (NLS), the EPO ZFP-862 zinc finger binding domain, a VP 16 
transcriptional activation domain and a FLAG epitope, under transcriptional control of a 

20 CMV promoter and a bovine growth hormone polyadenylation signal, was digested with 
Not I to release VP16-encoding sequences. See Zhang et al (2000) J, Biol Chem. 
275:33,850-33,860 for the design and properties of EPOZFP-862, which binds to a site 
862 nucleotides upstream of the EPO transcriptional startsite. The Not I-digested 
amplification product described in tiie previous paragraph was inserted into the ZFP-862c 

25 vector backbone by ligation, to generate a plasmid encoduig a NLS, the EPO ZFP-862 
binding domain, amino acids 781-1385 of SRCl and a FLAG epitope. The structure of 
the resnltmg construct, pSRClb-EP02c, is illustrated schematically in Figure 4. 

This construct was introduced into human HEK 293 cells by transfection (200 ug 
of plasmid plus 5 ug of Lipofectamine; Lipofectamine obtained from Gibco/Life 

30 Technologies, Gaithersburg, MD). Approximately 12 hours after exposure of cells to 
plasmid, the medium was removed and replaced with firesh DMEM supplemented with 
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10% fetal bovine serum. Twenty-four hours later, the medium was harvested and 
assayed for secreted EPO, using an erythropoietin ELISA from R&D Systems 
(Minneapolis, MN). 

Results of the assay, shown in Figure 5, indicated that transfection of 293 cells 
5 with the pSRClb-EP02c fusion plastnid activated expression of EPO, compared to cells 
transfected with a control plasmid (pcDNAS.l) not encoding such a fusion. Thus, ZFP- 
targeted binding, to the EPO gene, of a protein which is capable of chromatin remodeling 
(by virtue of its histone acetyltransferase activity) and can serve as a component of 
chromatin remodeling complexes (by virtue of its ability to bind p300 aad CBP) resulted 
10 in activation of gene expression. 

Example 13: Repression of VEGF expression by ZFP-MBD and ZFP-DNMT 

fusions 

Methyl binding domain proteins (MBDs) participate in repression of the 

15 expression of certain genes by binding to methylated cytosine residues present in CpG 
dinucleotides and recraiting chromatin remodeUng complexes to the site of binding. 
MBDs are also present as a component of certain chromatin remodeling complexes. 

DNA N-methyl transferases (DNMTs) methylate cytosine residues present in 
certain CpG dinucleotide sequences in cellular DNA. Such raethylation can lead to 

20 chromatin remodeling at or in the vicinity of the methylated sequence(s) by, for example, 
binding of one or more MBDs and concomitant or subsequent recruitment of chromatin 
remodeling complexes. The DNMTl protein can also associate with histone deacetylases 
(HDACs)^ which themselves are involved in chromatin remodeling. 

A series of ZFP-MBD and ZFP-DNMT fusions were tested for their abiHty to 

25 regulate expression of the human VEGF-A gene. Accordingly, a series of plasmids was 
constructed, in which the VEGF3a/l ZFP binding domain (see Example 3^uprd) was 
fused to MBD2b, MBDS, MBD3S, MBD3L, DNMTl, DNMTSa or DNMTSb. See, for 
example, GenBanlc accession numbers AF072243, AF170347, AW872007, and 
NM013595. The fusion genes also comprised a nuclear localization signal and a FLAG 

30 epitope, similar to the constructs described in Examples 1 1 and 12. Figure 6 shows a 
schematic diagram of these constructs. 
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HeLa cells were transfected with the constructs shown in Figvire 6. Seventy-two 
hours after transfection, secreted VEGF levels were measured using a VEGF ELISA 
(R&D Systems, Minneapolis, MN) according to the manufacturer's instructions. Cells 
were co-transfected with a green fluorescent protein-encoding plasmid to allow 
measurement of transfection efficiency. The results, presented in figure 7, show that 
transfection of HeLa cells with all of the MBD and DNMT fusions tested resulted in 
repression of VEGF expression. When corrected for transfection efficiency 
(approximately 50% in this experiment), intracellular expression of the 
MBD2b-VEGF3a/l fusion resulted in essentially 100% repression of VEGF expression. 
Thus, fusions between a targeted ZFP binding domain and proteins whose mechanism of 
modulating gene expression involves chromatin remodeling are able to repress gene 
expression. 

Although disclosure has been provided in some detail by way of illustration and 
example for the purposes of clarity of understanding, it will be apparent to those skilled 
in the art that various changes and modifications can be practiced without departing from 
the spirit or scope of the disclosure. Accordingly, the foregoing descriptions and 
examples should not be construed as limiting. 
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CLAIMS 

What is claimed is: 

1p a method foi' modifying a region of interest in cellular chromatin, the 
method comprising the step of contacting the cellular chromatui with a fusion molecule 
5 that binds to a binding site in the region of interest, wherein the fusion molecule 

comprises a DNA binding domain and a component of a cliromatin remodeUng complex 
or functional fragment thereof, thereby modifying the region of interest. 

2. The method of claim 1, wherein the cellular chromatin is present in a plant 

cell, 

10 3. The method of claim 1 , wherein the cellular chromatin is present in an 

animal cell. 

4. The method of claim 3, wherein the cell is a human cell. 

5, The method of any of claims 1 to 4, wherein the fusion molecule is a 
fusion polypeptide. 

15 6. The method of any of claims 1 to 5, wherein the DNA-binding domain 

comprises a zinc fmger DNA-binding domain. 

7. The method of any of claims 1 to 5, wherein the DNA-binding domain is a 
triplex-forming nucleic acid or a minor groove binder. 

8. The method of any of claims 1 to 7, wherein the component of a 
20 chromatm remodeling complex or fimctional fragment thereof is an enzymatic 

component. 

9. The method of any of claims 1 to 7, wherein the component of a 
chromatin remodeling complex or functional fragment thereof is a non-enzymatic 
component, 

25 10. The method of any of claims 1 to 9, wherein chromatin modification 

facilitates detection of a sequence of interest. 

!!• The method of claim 10, wherein the sequence of interest comprises a 
single nucleotide polymorphism. 
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12. The method of any of claims 1 to 1 1 , wherein chromatin modification 
facilitates activation of a gene of interest. 

13. The method of any of claims 1 to 11, wherein chromatin modification 
facilitates repression of a gene of interest. 

5 14. The method of any of claims 1 to 1 1 , wherein chromatin modification 

facilitates recombination between an exogenous nucleic acid and cellular chromatin. 

15. The method of claim 5, wherein the method further comprises the step of 
contacting a cell with a polynucleotide encoding the fusion polypeptide, wherein the 
fiision polypeptide is expressed in the cell. 

10 16. The method of any of claims 1 to 15, further comprising the step of 

identifying an accessible region in the cellular chromatin, wherein the fusion molecule 
binds to a target site in the accessible region. 

17. The method of any of claims 1 to 16, wherein the region of interest 
comprises a gene. 

15 18. The method of claim 17, wherein the gene encodes a product selected 

from the group consistmg of vascular endothelial growth factor, er5l;hropoietm, androgen 
receptor, PPAR-y2, pl6, p53, pRb, dystrophbi and e-cadherin. 

19. The method of any of claims 1 to 18, further comprising the step of 
contacting the cellular chromatin with a second molecule. 

20 20. The method of claim 1 9, wherein the second molecule is a transcriptional 

regulatory protein. 

21. The method of claim 19, wherein the second molecule is a fusion 
molecule. 

22. The method of claim 21, wherein the second molecule is a fusion 
25 polypeptide. 

23. The method of claim 21, wherein the second molecule comprises a zinc 
finger DNA-binding domain. 
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24. The method of clami 23, wherein the second molecule further comprises a 
transcriptional activation domain* 

25- The method of claim 23, wherein the second molecule farther comprises a 
transcriptional repression domain. 

26. The method of claim 23, wherein the second molecule further comprises a 
polypeptide sequence selected from the group consisting of a histone acetyl transferase, a 
Mstone deacetylase, a functional fragment of a histone acetyl transferase, and a functional 
fragment of a histone deacetylase. 

27. The method of claim 19, further comprising the step of contacting the 
cellular chromatin with a third molecule. 

28. The method of claim 27, wherein the third molecule is a transcriptional 
regulatory protein. 

29. The method of claim 27,wherein the third molecule is a fusion molecule. 

30. The method of claim 29,wherein the third raolecule is a fusion 
polypeptide. 

3 1 . Tlie method of claim 29^, wherein the third molecule comprises a zinc 
finger DNA-binding domain* 

32. The method of claim 31, wherein the third molecule further comprises a 
transcriptional activation domain. 

33. The method of claim 3 1 , wherein the third molecule further comprises a 
transcriptional repression domain. 

34. A fusion polypeptide comprising: 

a) a DNA binding domain; and 

b) a component of a chi-omatin remodeling complex or a functional fragment 
thereof 

35. The polypeptide of claim 34, wherein the DNA-binding domain is a zinc 
finger DNA binding domain. 
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36. The polypeptide of claim 34 or claim 35, wherein tlie DNA biading 
domain binds to a target site in a gene encoding a product selected from the group 
consisting of vascular endothelial growth factor, erythropoietin, androgen receptor, 
PPAR-Y25 pi 6, p53, pRb, dystrophin and e-cadherin. 

5 37- The polypeptide of any of claims 34 to 36, wherein the component of a 

chromatin remodeling complex or flmctional fragment thereof is an enzymatic 
component. 

38. The polypeptide of any of claims 34 to 36, wherein the component of a 
chromatin remodeling complex or functional fragment thereof is a non-enzymatic 

10 component. 

39. ^ The polypeptide of claim 37, wherein the enzymatic component of a 
chromatin remodeling complex or functional fragment thereof is selected from the group 
consisting of a S WI/SNF complex family member^ an complex family member, an 
ISAVI complex family member, a BRM family member, a BRG/BAF complex family 

15 member, a Mot-1 complex family member, a Chd-1 family member, a Chd-2 family 
member, a Chd-3 family member, a Chd-4 family member, a histone acetyl transferase 
and a histone deacetylase. 

40. A polynucleotide encoding the fusion polypeptide of any of claims 34 to 

39. 

20 41 . A cell comprising the fusion polypeptide of any of claims 34 to 39. 

42. A cell comprising the polynucleotide of claim 40. 

43. A method for modulating expression of a gene, the method comprising the 
steps of: 

a) contacting cellular chromatin with a first fusion molecule that binds to a 
25 binding site in cellular chromatin, wherein the binding site is in the gene and wherein the 
first fusion molecule comprises a DNA-binding domain and a component of a chromatin 
remodeling complex or functional fragment thereof; and 
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b) further contacting the ceUular chromatin with a second molecule that binds to a 
target site in the gene and modulates expression of the gene. 

44- The method of claim 43, wherein modulation comprises activation of 
expression of the gene. 

5 45. The method of claim 43, wherein modulation comprises repression of 

expression of the gene. 

46. The method of any of claims 43 to 45, wherein the DNA-binding domain 
of the first fusion molecule comprises a zinc finger DNA-binding domain. 

47. The method of any of claims 43 to 46, wherein the second molecule is a 
10 polypeptide. 

48. The method of claim 47 wherein tlie second molecule comprises a zinc 
finger DNA-binding domain. 

49. The method of claim 48, wherein the second molecule further comprises 
an activation domain. 

15 50. The method of claim 48, wherein the second molecule further comprises a 

repression domain. 

51. The method of claim 43 wherein the second molecule is a transcription 

factor. 

52. The method of claim 5 1, wherein the transcription factor is aa exogenous 
20 molecule. 

53. The method of claim 51, wherein the transcription factor is an endogenous 
molecule. 

54. The method of claim 43 wherein the first fusion molecule and the second 
molecule each comprise a zinc finger DNA-binding domain. 

25 55. The method of any of claims 43 to 54, wherein a plurality of first flision 

molecules is contacted with cellular chromatin, wherein each of the first fusion molecules 
binds to a distinct binding site. 
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56. The method of any of claims 43 to 55, wherein a plurality of second 
molecules is contacted with cellular chromatin, wherem each of the second molecules 
binds to a distinct target site. 

57. The method of claim 55, wherein at least one of the first fusion molecules 
5 comprises a zinc finger DNA-binding domain. 

58. The method of claim 56, wherein at least one of the second molecules 
comprises a zinc finger DNA-binding domain. 

59. The method of any of claims 43 to 58, wherem the expression of a 
plurality of genes is modulated. 

1 0 60. The method of claim 59 wherein a plurality of first fixsion molecules is 

contacted with cellular chromatin, wherein each of the first fiision molecules binds to a 
distinct binding site. 

61. The method of claim 60 wherein at least one of the first fusion molecules 
is a zinc finger fusion polypeptide. 

1 5 62. The metiiod of claim 59, wherein a plurality of second molecules is 

contacted with cellular chromatin, wherein each of the second molecules binds to a 
distinct binding site. 

63. The method of claim 62 wherein at least one of the second molecules is a 
zinc finger fusion polypeptide. 

20 64. The method of claim 59 wherein the first fusion molecule binds to a 

shared binding site in two or more of the plurality of genes. 

65. ThG method of claim 64 wherein the first fusion molecule is a zinc finger 
fiision polypeptide* 

66. The method of claim 59 wherein the second molecule binds to a shared 
25 target site in two or more of the plurality of genes. 

67. The method of claim 66 wherein the second molecule is a zinc fmger 
fiasion polypeptide. 
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68. The metliod of claim 1, wherein chromatin modiJ&cation results in the 
generation of an accessible region in the cellular chromatin 

69. The method of claim 68, wherein generation of the accessible region 
facilitates binding of an exogenous molecule. 

70. The method of claim 69, wherein the exogenous molecule is selected from 
the group consisting of polypeptides, nucleic acids, small molecule therapeutics, minor 
groove binders, major groove binders and intercalators. 

71. A method for producing a fusion polypeptide, wherein the fusion 
polypeptide comprises a zinc finger DNA binding domain and a component of a 
chromatin remodeUng complex or a functional fragment thereof, the method comprising 
the step of expressing the polynucleotide of claim 40 in a suitable host cell. 

72. A method for binding an exogenous molecule to a binding site, wherein 
the binding site is located withhx a region of interest in cellular chromatin, wherein the 
method comprises: 

(a) contacting cellular cliromatin with a fusion molecule that binds to a 
binding site in the region of interest, wherein the fusion molecule comprises a DNA 
binding domain and a component of a chromatin remodeling complex or ftinctional 
fragment thereof, thereby modifying cellular chromatin within the region of interest; and 

(b) introducing the exogenous molecule into the cell; 
whereby the exogenous molecule binds to the binding site. 
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FIGURE 6 
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